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1  Introduction 


1.1  Research  Objectives 

The  objective  of  this  work  is  the  development  of  adaptive  control  methods  which  can  signif¬ 
icantly  improve  closed-loop  performance  for  a  broad  class  of  nonlinear  flexible  systems. 

Towards  this  end,  the  goals  were  as  follows: 

1.  Form  a  parametric  model  representing  a  broad  class  of  nonlinear  flexible  systems. 

2.  Design  a  nonlinear  controller  based  on  the  parametric  model  which  provides  desired 
closed-loop  performance  if  the  parameters  were  known. 

3.  Identify  the  parameters  of  the  system  using  measured  data  and  use  these  estimates  in 
the  controller. 

4.  Analyze  the  stability  and  performance  properties  of  the  complete  adaptive  system  using 
the  method  of  averaging. 

We  concentrated  most  of  our  effort  on  items  1  and  2  as  these  turned  out  to  be  the  most 
difficult  and  challenging  tasks.  These  must  first  be  resolved  in  order  to  proceed  to  adaptation 
(items  3  and  4). 

In  general,  the  final  adaptive  system  described  above  fits  into  the  generic  adaptive  scheme 
depicted  in  Figure  1. 


Estimated  Parameters 


Figure  1:  Adaptive  control  system  with  parameter  estimator. 

In  this  traditional  adaptive  control  system,  often  referred  to  as  the  self-tuning-regulator 
(STR),  [2],  the  identified  model  is  usually  selected  out  of  a  model  set  with  unknown  pa¬ 
rameters.  The  controller  is  designed  as  if  the  parameter  estimates  were  in  fact  the  correct 
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parameters  for  describing  the  plant.  In  the  ideal  case  it  is  assumed  that  there  exist  parame¬ 
ters,  which  if  known,  would  precisely  account  for  the  measured  data.  Even  in  this  ideal  case, 
the  transient  errors  between  the  identified  model  and  the  true  system  can  be  so  large  as  to 
completely  disrupt  performance.  In  the  usual  (non-ideal)  case  -  the  true  system  is  not  in 
the  model  set  -  both  unacceptable  transient  or  asymptotic  behavior  can  occur  e.g.,  [1]. 

The  above  cited  problems  are  well  documented  in  the  case  of  adaptive  linear  control, 
t.e.,  when  the  adaptive  parameters  are  held  fixed  the  closed-loop  system  is  linear.  However, 
the  case  considered  here  is  adaptive  nonlinear  control,  f.e.,  when  the  adaptive  parameters 
are  held  fixed  the  closed-loop  system  is  nonlinear.  Naturally  the  same  problems  will  arise 
in  this  case  as  in  the  adaptive  linear  case.  Unfortunately  many  of  the  controller  design  and 
parameter  estimator  issues  have  not  been  developed  sufficiently  as  yet.  In  consequence,  the 
first  step  has  been  to  resolve  some  of  these  issues. 


1.2  Publications 

See  Appendices  A  and  B  . 


1.3  Personnel 

Dr.  Robert  Kosut  and  Dr.  M.  Giintekin  Kabuli  worked  on  the  project  during  the  reporting 
period. 


1.4  Interactions 

We  have  had  several  interactions  with  Prof.  Crawley  and  students  at  MIT,  Prof.  Rock  of 
Stanford,  Prof.  Sastry  of  Berkeley,  and  with  Norm  Coleman’s  group  at  ARDEC. 
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2  Status  of  Research  Effort 


This  section  is  divided  into  subsections  each  corresponding  to  the  stated  goals  in  Section  1.1. 

2.1  Modeling  the  Plant 

2.1.1  Nonlinear  Feedback  Interconnection 

A  large  class  of  nonlinear  systems  can  be  modeled  as  depicted  in  the  following  block  diagram. 


u 

V 


Figure  2:  Plant  model. 

In  this  configuration  P{s)  is  a  transfer  function  matrix  representing  the  linear-time- 
invariant  (LTI)  part  of  the  plant.  The  nonlinear  part  is  represented  by  /(•),  a  memoryless 
nonlinear  function,  which  appears  as  a  feedback  interconnection.  The  variable  y  is  the  sensed 
output  and  u  is  the  control  signal  applied  to  the  actuator  inputs.  The  variables  z  and  v  are 
not  usually  measured,  although  some  of  them  may  be  contained  in  y.  This  model  is  capable 
of  representing  all  of  the  significant  nonlinearities  in  large  space  structures. 

The  plant  model  can  be  described  in  both  input-output  as  well  as  state  space  format. 
The  input-output  description  is 


y 

z 


*  n 

U 

Pll 

P12 

u 

V 

P2I 

P22 

V 

V  =  f{z) 

Assuming  P{s)  is  finite-dimensional,  the  state  space  format  is: 


X  =  Ax  +  Biu  -f  B2V 

y  =  C\X  -t"  D\\U  +  D12V 

z  =  C2X  -f-  D21U  +  £^22^ 


As  a  simple  example  of  the  model  structure  in  Figure  2,  consider  the  mechanical  system 
depicted  in  Figure  3. 
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This  system  represents  the  case  where  tortional  actuation  is  applied  to  a  load  through  a 
flexible  gear-train.  The  gearing  is  shown  to  occur  at  the  end  of  the  flexible  member,  although 
other  combinations  are  certainly  possible. 

Neglecting  any  electronic  dynamics,  and  assuming  that  the  flexible  rod  is  both  uniform 
and  damped,  the  motion  of  the  rigid  body  and  first  tortional  “mode”  for  small  angular 
deflections  can  be  approximated  by  the  system  of  differential  equations. 


Jm^i 

=  U -Y  D{x2  —  Xx)  +  K{x2  — 

Xi) 

Jg^2 

=  —V  —  D{x2—Xi)  —  K{x2 

-  Xx 

Jl^3 

=  Nv 

Z 

=  X2  —  Nx3 

V 

=  M 

y 

=  X3 

where  u  denotes  the  input  applied  torque,  (xi,X2,X3)  are  angular  deflections  as  indicated  in 
the  figure,  the  load  displacement  y  =  X3  is  the  sensed  output,  z  is  the  relative  gear  angle,  and 
/(•)  is  a  memory  less  nonlinearity  arising  from  backlash  in  the  gear  train  which  depends  on 
a  parameter  6.  The  constants  are  defined  as  follows:  J\f,  Jg^  and  Jl  are  the  motor,  motor 
gear,  and  load  inertias,  respectively,  N  is  the  gear  ratio  which  is  greater  than  one,  and  D,  K 
are  the  damping  and  stiffness,  respectively,  of  the  elastic  rod. 

In  this  example  the  transfer  matrix  P{s)  hcis  the  block  structure  described  before  with 

_ Ds^K _ 

s^[JmJgs^  +  {Jm  +  Jg){Ds  +  K)] 

N 

s^Jl 

_ Ds  +  K _ 

+  {Jm  +  Jg){Ds  -I-  /f)] 

_ Jms^  +  Ds  +  K _ 

3^[JmJgs^  +  {Jm  +  Jg){Ds  -t-  K)] 


the  scalar  elements: 

Pu 

P21 

P22 
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A  typical  backlash  nonlinearity  is  the  cubic, 


f{z)  =  bz^ 


(1) 


where  6  is  a  constant. 

Although  this  model  is  based  on  the  rotating  elastic  system  with  a  motor,  it  bears  great 
similarities  to  nonlinear  joint  eifects  in  LSS.  Professor  Crawley  and  colleagues  from  M.I.T. 
have  identified  models  of  this  type.  A  distinguishing  feature  of  this  type  of  model  is  the 
large  number  of  joints  which  need  to  be  accounted.  Referring  to  the  model  in  Figure  2, 
this  means  that  the  dimension  of  the  vectors  (z,v)  is  also  very  large,  that  is,  the  nonlinear 
vector  function  f{z)  has  a  dimension  essentially  equal  to  the  number  of  joints  with  significant 
nonlinearities.  In  addition,  the  linear  part  of  the  model,  P{s),  which  is  a  matrix  of  transfer 
functions,  has  a  large  number  of  states  corresponding  to  the  lightly  damped  flexible  modes 
of  the  structure. 


2.1.2  Model  Uncertainty  and  Adaptation 

In  our  initial  efforts  it  is  assumed  that  the  LTI  part  P{s)  is  known  and  the  nonlinear  function 
/(z)  depends  on  an  unknown  or  uncertain  parameters.  For  example,  /(z)  =  6z^  where  6  is 
an  unknown  parameter.  More  generally, 

/(z)  =  F{z)6 

where  G(z)  is  a  known  matrix  of  nonlinear  functions  and  6  is  an  unknown  parameter  vector. 

The  objective  of  adaptive  control  is  to  estimate  6  from  the  input-output  measurements 
{u,y)  and  then  adapt  the  controller  to  the  estimate.  The  first  step  then  is  to  devise  a 
nonlinear  controller  which  depends  on  the  parameter  9.  The  adaptive  controller  will  be 
identical  but  with  9  replaced  by  the  adaptive  estimate  9. 

After  forming  the  parameter  adaptive  algorithm  or  estimator,  the  plan  is  to  analyze  the 
complete  adaptive  system  using  the  classical  method  of  averaging  analysis  [3,  4].  Under 
previous  AFOSR  funding  we  refined  the  averaging  method  for  application  to  linear  flexible 
systems  [1].  In  a  feasibility  study  for  nonlinear  systems,  also  supported  by  AFOSR,  we 
used  the  Duffing  oscillator  to  represent  the  nonlinear  flexible  system  and  adjusted  a  single 
parameter  in  the  controller.  The  results,  reported  in  [9],  were  very  encouraging.  We  observed 
from  simulations  that  the  adaptive  system  performed  very  well,  and  achieved  an  almost 
optimum  final  tuning  of  the  controller  parameter.  Moreover,  we  discovered  that  the  resulting 
performance  was  directly  predictable  from  the  averaging  analysis,  and  this,  despite  the  fact 
that  the  system  states  passed  in  and  out  of  chaotic  and  multi-periodic  attractors. 

Before  all  this  can  be  accomplished,  we  must  first  be  able  to  control  the  nonlinear  plant 
when  the  parameters  are  known.  This  proved  to  be  a  major  challenge,  thus  limiting  our 
efforts  with  adaptation.  As  the  more  interesting  results  concern  the  controller  design,  we 
limit  the  remainder  of  the  report  to  this  part  of  the  effort. 
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2.2  Nonlinear  Controller  Design  via  Feedback  Linearization 


In  this  section  we  describe  our  efforts  to  develop  a  nonlinear  controller  based  on  the  nonlinear 
plant  model  just  described.  Our  first  assumption  is  that  the  plant  parameters  are  known, 
i.e.,  both  P{s)  and  /(•)  are  known.  Methods  for  direct  control  design  of  nonlinear  plants 
are  few  and  far  between  without  making  additional  assumptions.  One  such  method,  known 
as  feedback  linearization  has  received  considerable  attention  in  recent  years,  e.^i.,  [5].  This  is 
our  starting  point. 

The  basic  idea  is  to  find  a  nonlinear  feedback  system  which  transform  the  system  states 
to  a  new  set  of  coordinates  where  the  system  appears  linear. 

To  see  how  this  approach  works,  suppose  that  all  the  elements  of  P{s)  are  strictly  proper. 
Then,  the  state-space  description  of  the  plant  is  becomes: 

X  =  Ax  -t-  BiU  -1-  B2f{z) 

y  =  Cix 

z  =  C2X 


Ifdet(CiBi) 

control. 


^  0,  that  is,  the  inverse  of  C^Bi  exists,  then  it  is  easy  to  show  that  the  nonlinear 


U  =  UNL  —  {ClBx)  ^[UL  —  C\Ax  —  CiB2f{z)] 


yields 


y  =  UL 


An  interpretation  is  that  the  nonlinear  control  globally  linearizes  the  plant  system  from 
input  ul  to  output  y.  In  this  case  the  feedback  linearized  plant  is  a  single  integrator  which 
is  easy  to  control,  e.g.,  let 

UL  =  a{r  -  y) 

where  r  is  a  reference,  usually  a  set-point  or  constant.  Thus,  under  feedback  linearization 
the  output  obeys  the  differential  equation, 


y  =  a{r  -  y) 

Thus,  for  any  positive  constant  a,  if  r  is  a  constant,  y{t)  — >  r  exponentially  fast  as  t  — >  00. 
Moreover,  for  any  bounded  r,  y  is  also  bounded. 

There  are  some  difficulties  with  using  this  method: 

1.  Stability 

Although  the  t/-system  is  stable,  the  remaining  states  in  x  may  not  be  bounded. 

2.  Implementation 

The  nonlinear  feedback  u^l  depends  on  all  the  system  states,  i,  which  in  general,  are 
not  available. 
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3.  Robustness 

Feedback  linearization  requires  precise  knowledge  of  the  plant,  i.e., knowledge  of  P{s) 
and  /(•). 

To  illustrate  the  above  issues  we  make  the  further  simplification  that  y  =  z,  i.e.,  the 
nonlinear  function  acts  directly  on  the  measured  output.  Furthermore,  the  system  states 
can  be  partitioned  so  that 

In  this  case  the  state  description  of  the  plant  becomes, 

y  —  -^ny  +  -^laC  +  ^11^  +  Bi2  f{y) 

C  =  -^2x2/  +  ■^22C  +  ^21'^  +  ^22/(3/) 

The  feedback  linearizing  control  now  becomes, 

unl  =  -y)-  Any  -  A12C  -  B2if{y)] 

The  closed-loop  system  is  then, 

y  =  a(r  -  y) 

C  =  A2iy  +  B2iB{'^la(r  —  y)  ~  Any  —  B2i/(y)]  +  B22f{y)  +  {A22  —  B2iB^^ Ai2]C, 

2.2.1  Stability 

In  our  example,  it  is  obvious  that  bounded  r  implies  bounded  y.  Since  the  y-system  is 
decoupled  from  the  ^-system,  C,  will  also  be  bounded  if  and  only  if  the  eigenvalues  of 
■^22  ■“  ^2x5x1^ -^12  all  have  negative  real  parts.  Although  we  have  made  many  simplifying 
assumptions,  this  test  can  be  generalized  [5]. 

2.2.2  Implementation 

Since  Unl  depends  on  states  which  cannot  be  measured,  an  obvious  solution  is  to  reconstruct 
these  states  from  measurements  with  an  observer  network.  Thus,  an  implementable  feedback 
linearizing  control  is, 

y-NL  =  {CiBiY^[ul  —  CiAx  —  C\B2f{z)\ 

X  -  Ax Biu  -f-  B2f{y)  +  L{y  -  Cix) 

where  L  is  chosen  so  that  AlC  is  Hurwitz.  To  see  that  the  resulting  closed-  loop  system  is 
stable,  define  the  state  error 

-  dcf  - 

X  =  X  —  X 

Then, 

y  =  a{r  —  y)  +  CAx 
X  =  (A  —  LC)x 

Since  x{t)  — ♦  0  exponentially  fast,  the  system  is  stable. 
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2.2.3  Robustness 


Robustness  is  more  difficult  to  determine.  For  example,  suppose  that  f{z)  is  an  estimate  of 
f{z).  The  feedback  linearizing  control  now  becomes, 

iJ-NL  =  {CiBi)~^[ul  —  CiAx  —  CiB2f{z)] 

X  =  Ai  +  Biu  4-  B2f{y)  +  L{y  -  Cix) 

where  as  before  L  is  chosen  so  that  AjjC  is  Hurwitz.  Again,  it  is  convenient  to  introduce  the 
error  variables, 

-  def 

X  =  X  —  X 

-  dcf 

y  =  y-y 

where  y  is  the  ideal  output  which  satisfies, 

y  =  a{r  -  y) 

Then,  the  closed-loop  system  is: 

X  =  {A  —  LC)x  +  B2/ 
y  =  -ay  -f  CAx  +  CB2f 

f  =  fiy)  -  f{y)y  y  =  y  +  y 

When  /  =  /  we  return  to  the  previous  case. 

Although  theoretical  results,  such  as  the  Total  Stability  Theorem  [1],  indicate  that  the 
above  system  has  robustness  properties,  we  have  discovered  from  simulations  that  these 
systems  can  be  extremely  sensitive  to  even  small  perturbations  (see  Appendix  A)  .  This  has 
motivated  us  to  find  another  means  to  apply  feedback  linearization.  This  effort  is  described 
next. 


2.2.4  Effect  of  Feedforward  Control 

The  block  diagram  below  shows  a  controller  configuration  which  consists  of  a  feedforward 
signal  generator  and  an  inner  feedback  loop. 

Observe  that  the  feedforward  controller  copies  the  plant  using  the  estimate  /  in  place 
of  the  unknown  nonlinear  function  /.  The  feedforward  block  produces  two  signals,  y  and 
u.  To  generate  these  we  feedback  linearize  the  “model”  in  the  feedforward  signal  generator. 
Since  this  is  a  model,  all  states  are  available  and  feedback  linearization  is  ideal.  The  signal 
u  then  tries  to  undo  the  effect  of  the  unknown  nonlinearity  /  in  the  actual  plant.  Any  errors 
will  appear  in  the  error  signal  y  —  y  which  is  acted  upon  by  the  feedback  controller  K  in  the 
inner  loop.  This  controller,  hopefully  acting  on  small  errors,  can  most  likely  be  linear  and 
independent  of  errors  between  /  and  /. 
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Figure  4:  Nonlinear  feedforward  controller. 


Using  the  previous  example,  the  control  is  now: 
u  =  u  -\-u 

u  =  {CrB^)-\a(r-y)~CrAx-C^B^f{z)] 

X  =  Ax  +  Biu  +  B2f(y) 

y  =  a(r  ~  y) 

To  examine  the  closed-loop  system,  it  is  again  convenient  to  introduce  the  error  variables, 

.  def 

X  =  X  —  X 
-  def 

y  =  y-y 

Here  x  is  the  state  of  the  feedforward  signal  generator  rather  than  the  estimator  state.  Hence, 
X  is  bounded.  As  before  y  is  the  ideal  output.  Now  the  closed-loop  system  is: 

X  =  Ax  -f  B\u  +  ^2/ 

y  =  C-ix 

f  =  f{y)  -  f{y),  y  =  y  +  y 

This  system  is  very  similar  to  that  previously  developed  using  feedback  linearization  with 
an  observer.  However,  we  do  not  encounter  the  same  difficulties  with  this  system  in  the 
simulations  referred  to  in  Appendix  A  . 
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2.3  Towards  a  Nonlinear  Control  Design  Based  on  Measured 
Variables 


Within  the  context  of  finite-dimensional  linear  time-invariant  (LTI)  feedback  interconnec¬ 
tions  and  LTI  plants,  full-order  estimated  state-feedback  based  design  approaches  are  widely 
used  in  control  design  problems.  After  suitable  augmentations,  the  design  problem  is  reduced 


to  the  stabilization  of 


A  B 
C  0 


(can  be  taken  as  strictly  proper,  without  any  loss  of  gen¬ 


erality)  subject  to  a  particular  cost  criterion  (e.g.,  7^2  and/or  'Hoo)-  The  analytical  and/or 
numerical  solutions  to  the  associated  optimization  problem  yields  a  state-feedback  gain  K 
and  an  output-injection  gain  L  ,  where  the  matrices  {A  —  BK)  and  (A  —  LC)  are  both  strictly 


Hurwitz.  The  resulting  controller  is  given  by 


A-BK-LC  L 
-K  0 


The  relative  ease  of 


solutions  associated  with  such  design  problems  stems  from  the  inherent  decomposition  into 
two  sub-problems  which  can  be  dealt  with  separately.  The  estimation  of  the  state  and  the 
control  law  based  on  the  ideal  state  are  solved  separately,  and  then  the  estimated  state  is 
used  instead  of  the  true  plant  states  in  the  control  law.  This  two-step  approach  is  referred 
to  as  the  separation  principle. 


Before  proceeding  with  nonlinear  control  design  methods  and  why  the  separation  principle 
fails  in  general,  it  is  useful  to  observe  the  inherent  properties  that  make  the  separation 
principle  work  in  the  LTI  setting.  Consider  the  plant  P  :  u  y  , 


X  =  Ax  4-  Bu 
y  =  Cx 


with  a  stabilizable  pair  (A,B)  and  a  detectable  pair  (C,A)  .  If  the  plant  state  x  were 
available,  a  control  of  the  form  u  —  —Kx  would  solve  the  stabilization  problem.  Since 
X  is  not  available,  an  estimate  x  is  constructed  using  {u,y)  or  perturbed  versions  thereof. 
Consider  the  estimator  E  :  [ue,ye)  *— >  x  , 

X  =  (A  —  LC)x  -f-  Bue  +  Lye  , 


where 


Ue  +  du  —  u 
y  +  dy  =  ye 

where  the  exogenous  inputs  du  and  dy  denote  the  unmeasured  but  bounded  actuator  and 
sensor  disturbances,  respectively.  Letting  Cj.  ;=  x  — x  ,  and  using  the  estimates  in  the  control, 
\.e.,  Ue  =  —Kx  ,  the  closed-loop  system  is  described  by 

X  =  {A  —  BK)x  BKe^  +  Bd^ 
e'l  =  {A  —  LC)ex  +  Bd.a  —  Ldy 

The  properties  that  make  the  separation  principle  work,  can  be  listed  as  follows: 
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1.  For  bounded  (du,d„)  ,  is  bounded.  When  (du,dy)  =  0  ,  — >  0  . 

2.  For  bounded  estimation  error  e*  ,  x  is  bounded.  As  e*  — >  0  ,  x  — >  0  . 

Due  to  possibly  persistently  exciting  disturbance  ;  e*  is  not  expected  to  go  to 

zero.  Hence,  the  state-feedback  (in  terms  of  x)  must  be  robust  to  perturbations  Cx  which 
does  not  necessarily  correspond  to  initial  conditions  mismatches  in  P  and  E  . 

For  the  sake  of  argument,  suppose  that  one  indeed  has  an  asymptotic  observer  and  a 
“globally”  asymptotically  stabilizing  full-state  feedback  law.  Intuitively,  for  “small”  errors 
in  the  state-estimate,  the  separation  principle  based  control  design  will  be  stabilizing.  Thus, 
after  waiting  “sufficiently  long”  for  reliably  reconstructing  the  states,  one  can  use  the  esti¬ 
mated  states  instead  of  the  true  states  in  the  control.  Apart  from  the  nontrivial  assumption 
that  such  an  asymptotic  observer  is  available,  another  pitfall  in  such  an  argument  is  that 
one  may  not  be  able  to  wait  without  a  nominal  bounded-input  bounded-output  (BIBO)- 
stabilizing  loop,  since  the  plant  may  blow  up  before  estimation  errors  are  sufficiently  small, 
(e.g.,  X  =  +  u,  u  =  0,  x(0)  =  xo  >  0)  x(t)  =  )  .  The  existence  of  such  a  nominal 

stabilizing  loop  is  the  same  as  requiring  a  control  that  robustly  BIBO-stabilizes  the  plant, 
although  the  desired  performance  may  not  be  achieved. 

Imposing  the  algebraic  nonlinearities  in  the  ordinary  differential  equations  to  be  globally 
Lipschitz  in  order  to  extend  the  appealing  properties  of  linear  vector  fields,  may  be  quite 
restrictive.  Nonlinearities  in  one  coordinate  system  may  be  Lipschitz,  whereas  in  an  another 
coordinate  system  they  may  not  be.  Consider 

I'l  =  X2  +  /(xi) 

X2  =  X3 
X3  =  U  . 

Under  the  nonlinear  transformation 

^2  =  X2  +  /(xi) 

6  =  2:3  +  (12  + /(xi))/(^)(xi) 

the  input  to  state  map  can  be  expressed  in  terms  of 

6  =  6 

6  —  (3 

6  =  u  +  )  +  6/*'>to)  . 

' - ' 

«(0 


Even  if  /(•)  is  Lipschitz,  a(  )  is  not. 

For  a  given  nonlinear  plant  V  :  u  >-*  y  ,  consider  the  tracking  problem:  for  a  particular 
class  of  reference  signals  r  ,  determine  a  control 
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such  that  y  —*  r  and  for  a  reasonable  cla^s  of  bounded  disturbances,  under  the  perturbed 
control 

u  =  du  +  K{r,  y  +  dy) 

the  closed-loop  signals  remain  bounded.  In  other  words,  the  closed-loop  (see  Figure  5)  is 
BIBO-stable  (in  the  sense  of  a  particular  extended  space)  and  achieves  the  desired  tracking 
when  disturbances  are  zero. 


dmj 


Figure  5;  Desired  dynamic  controller  K. 


We  will  focus  on  a  particular  class  of  single-input  nonlinear  plants  which  admit  a  finite¬ 
dimensional  state-space  description  of  the  form 

X  =  f{x)  -f  g{x)u 

for  which  the  State-Space  Exact  Linearization  Problem  [5]  is  solvable  over  IR",  i.e.,  control¬ 
lability  and  involutivity  conditions  hold  in  IR"  .  For  such  plants,  when  states  are  available, 
the  origin  can  be  rendered  globally  asymptotically  stable.  Moreover,  when  i  is  available  for 
feedback,  the  asymptotic  tracking  problem  can  be  solved  for  any  output  that  does  not  intro¬ 
duce  a  nonminimum-phase  zero  dynamics.  In  the  case  where  the  output  y  to  track  r  does 
introduce  a  nonminimum-phase  zero  dynamics,  one  may  be  required  to  introduce  dynamic 
augmentation  prior  to  full-state  feedback.  Such  strong  global  results  do  rely  on  perfect  state 
measurement  and  render  controller  candidates  as  shown  in  Figure  6  . 


Figure  6:  Solvable  design  problem,  dynamic  controller  Cq;  di  =  0,  dy  =  0 

Taking  into  account  the  control  law  in  Figure  5  and  the  fact  that  one  typically  encounters 
number  of  sensors  much  less  than  the  number  of  states,  one  might  dismiss  the  plant  state 
based  control  laws  on  the  grounds  that  they  are  not  implementable.  As  it  is  motivated  by 
the  separation  principle  based  control  design,  for  classes  of  plants  that  admit  asymptotic 
observers  or  in  the  case  of  bounded  state-estimation  errors,  the  stability  properties  of  the 
control  law  with  y  =  x  in  Figure  5  is  of  major  importance  (see  also  Figure  8). 

Motivated  by  the  separation  principle  based  control  design,  a  design  approach  can  be 
taken  as  follows: 
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•  Assumption  ;  Imposing  stability  properties  on  {u,y)  is  equivalent  to  imposing  them 
on  X  .  The  plant  V  :  u  y  does  not  have  any  “hidden  modes”.  The  input  output  pair 
{u,y)  completely  characterizes  the  internal  dynamics. 

•  Problem  I  :  An  estimator  that  allows  the  decoupling  of  the  design  problem  (see 
Figure  7).  Recall  from  the  motivating  LTI  setting,  that  the  role  of  the  estimator  is  to 
guarantee  that  for  all  bounded  d^,  bounded  dy  and  possibly  unbounded  Ue  ,  the  error 
(x  —  x)  is  bounded.  Moreover,  when  =  0  and  dy  =  0  ,  ex  —*  0  .  In  other  words, 
for  the  interconnection  in  Figure  7  ,  the  error  Cx  must  be  uncontrollable  from  u  and 
the  bound  on  must  depend  only  on  the  bound  on  the  exogenous  variable  dy  .  This 
property  is  the  crux  of  the  separation  principle,  since  it  simplifies  the  design  problem 
to  Problem  II  . 


u 


a 


EST 

T 


'X 


Figure  7:  Problem  I;  estimator  with  (x  —  x)  G  Loo  for  all  u  in  Loo,e  and  dy  in  Lc 


•  Problem  II  :  A  stabilizing  state-feedback  law.  For  bounded  dx,  d^  and  dy,  determine 
a  controller  Ci  for  which  the  closed-loop  signals  remain  bounded.  Moreover,  when 
d„  =  0  and  dy  =  0  ,  as  dx  —*  Q  ,  y  —*  r  .  Note  that  this  step  is  nontrivial  since 
the  controller  Cq  in  Figure  6  is  typically  based  on  d*  =  0  and  dy  =  0  .  The  local 
stability  properties  of  the  design  in  Figure  6  may  not  suffice  for  the  specified  bound 
on  dx  obtained  from  Problem  I.  With  a  slight  abuse  of  notation,  from  now  on  we  will 
refer  to  C-i  in  Figure  8  as  a  robust  controller,  since  its  design  does  take  into  account 
the  predetermined  bounds  on  d*  and  dy  .  The  abuse  is  due  to  the  fact  that  in  the 
LTI  setting,  robustness  is  attributed  to  closed-loop  signal  dependent  perturbations, 
since  exogenous  additive  disturbances  cannot  drive  a  stable  LTI  loop  unstable.  Since 
local  and  global  stability  results  merge  in  the  LTI  setting  and  global  stability  results 
are  too  restrictive  in  the  nonlinear  setting,  perhaps  a  better  suited  description  for  the 
controller  Cj  in  Figure  8  is  semi-global  ;  i.e.,  for  the  given  bounds  on  du,dx  and  dy  the 
closed-loop  signals  remain  bounded. 


r 

C, 

n 

1*^ 

^  ’  d 

/*-V 

Figure  8:  Problem  II;  dynamic  controller  Ci,  possibly  different  from  Co 
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•  Merging  Solutions  to  Problems  I  and  II  :  Recall  that  the  solution  to  Problem  I 
guarantees  that  bounded  deviations  about  possibly  unbounded  input  output  pairs  of  V 
result  in  bounded  e*  .  Hence  one  can  incorporate  Figure  7  in  Figure  8  as  the  bounded 
disturbance  dx  generator  to  obtain  Figure  9  . 


Figure  9;  Candidate  controller  JC 


Note  that,  in  the  case  that  the  estimator  in  Problem  I  satisfies  the  property  that  e*  — »  0 
for  du  =  0  and  dy  =  0  ,  that  is  when  x  is  asymptotically  reconstructed,  the  choice  of  the 
robust  controller  Ci  in  Problem  II  guarantees  that  the  closed-loop  signals  remain  bounded 
due  to  bounded  dx  ;  in  addition,  the  nominal  design  constraint  on  Ci  ensures  that  y  — »  r  as 
dx^O  . 

As  a  preliminary  step  towards  control  design  based  on  state-estimates,  the  following  two 
studies  show  the  importance  of  the  choice  of  the  nominal  globally  stabilizing  (or  tracking) 
control  law  based  on  perfect  state  measurements. 

In  the  first  study,  set-point  asymptotic  tracking  problem  is  solved  for  an  output  that 
exhibits  nonminimum-phase  zero  dynamics.  Using  dynamic  augmentation  and  then  applying 
state-feedback  asymptotic  tracking  problem  is  solved.  In  other  words,  the  controller  Co  in 
Figure  6  is  obtained.  The  derivations  emphasize  that  the  standard  Lipschitz  constraints,  for 
ease  of  Lyapunov  based  derivations,  on  the  resulting  vector  fields  after  possible  changes  in 
coordinates  is  too  restrictive. 

In  the  second  study,  stability  properties  of  the  origin  is  investigated  when  the  feedback  law 
is  based  on  estimates  from  an  asymptotic  observer.  It  is  shown  that  the  globally  stabilizing 
exact  feedback  linearization  based  controller  Co  in  Figure  6  is  no  longer  globally  stabilizing 
in  Figure  7  .  A  globally  stabilizing  controller  Cx  that  solves  the  Problem  II  is  constructed. 
Using  the  asymptotic  observer,  the  controller  K,  in  Figure  9  is  obtained. 
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2.3.1  Example  1:  Tracking  and  Nonminimum-Phase  Zero  Dynamics 

In  order  to  illustrate  a  strictly  causal  plant  with  a  one-dimensional  possibly  nonminimum- 
phase  zero  dynamics,  consider  the  following  two-state  u  to  y  map. 

il  =  X2  +  /(xi) 

X2  —  ^X2  —  Xi)  +  U 

y  =  X2-  xi 

Note  that  when  /  =  0  ,  the  transfer  function  from  u  to  y  is  • 

It  is  easy  to  see  that  a  tracking  control  based  on  the  coordinate  transformation  using 
derivatives  of  y  will  not  be  stabilizing.  Note  that 

y  =  Ay  +  u-X2~  /(xi)  . 

For  r  =  0  , 

u  =  -Ay  -f  i2  +  /(xi)  +  Q!(r  -  y) 

results  in 

(y-r)  =  -a(y-r)  ; 

hence  for  any  a  >  0  ,  y  — »  r  .  However,  the  asymptotic  tracking  constraint 

X2  =  xi  -h  y(0)e"“‘ 

renders 

xi  =  xi -f- /(jci) -t- r -f  (y(0)  -  r)e"“*  . 

Take  for  example,  r  =  0,  /(■)  =  (•)^  .  Clearly,  in  the  limit 

I'l  =  Xi  -1-  Xj  , 

and  the  states  are  unbounded. 

When  /  is  zero,  it  is  well  known  that  step  inputs  can  be  asymptotically  tracked  if  and 
only  if  plant  and/or  the  compensator  have  at  least  one  pole  at  zero.  Since  does  not 

have  a  pole  at  zero,  the  standard  LTI  design  procedure  would  involve  augmentation  with 
an  integrator.  Proceed  with  the  same  augmentation  for  the  nonlinear  plant  at  hand  (see 
Figure  10)  . 


u 
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Figure  10:  Augmented  plant 

If  X  were  available,  since  rj  is  part  of  the  controller,  the  augmented  plant  states  are  avail¬ 
able.  If  one  can  find  a  relative  degree  three  output,  i.e.,  solve  the  state-feedback  linearization 
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problem,  then  ^  ♦  0  would  imply  that  y  r  .  This  requires  that  the  augmented  plant 

dynamics 

77  =  r  —  X2  +  xi 

=  2:2  +  /(xi) 

X2  —  4(x2  -  Xi)  -hu 

satisfies  the  controllability  and  involutivity  conditions.  Both  conditions  are  satisfied  provided 
that 

which  we  will  assume  for  global  results.  It  is  interesting  to  note  that  this  condition  is  in  fact 
a  generalization  to  the  hidden  mode  concept  in  the  LTI  setting,  i.e.,  if  /(xi)  =  — xi  ,  the 
linear  plant  from  u  to  y  would  have  a  hidden  unstable  mode. 

Now  consider  the  relative  degree  three  output 

z  =  Tj  Xi 

motivated  by  the  /  =  0  case.  Note  that 

=  r  +  xi  +  f{xi) 

2(2)  ^  (1  q.  + /(ii)) 

^(2:1) 

and  recall  that  the  (global)  controllability  condition  is  equivalent  to  /3(xi)  ^  0  for  all 

xj  G  ni  .  Hence,  under  the  coordinate  transformation 

(  = 

the  augmented  plant  dynamics  can  be  put  into  the  controllable  canonical  form 

^1  =  (2 

(2  =  (3 

6  =  ■  (X2  +  /)^  +^(xi)(u  +  4(x2  -  Xj)  +  •  (X2  + /)) 

' - V - - 

a(x)  +P{xi)u 

where  dependencies  on  xi  are  suppressed  in  the  /  terms,  for  the  sake  of  brevity.  For  any 
ip2  and  '03  such  that  the  polynomial 

+  02-®  d"  03 

is  strictly  Hurwitz,  the  control  u  determined  by 

a(x)  +  0{xi)u  =  03^1  -  026  -  0i6 
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renders  ^  =  0  globally  asymptotically  stable  (recall  that  the  transformation  to  (  coordinates 
is  one-to-one  and  onto)  .  Hence 


0  =  r  -I-  ii  -f  f{xi) 

0  =  P(xi)(x2  +  f(xi)) 


and  ^{xi)  0  imply  that 


r  -t-  ii  -  i2  — »  0 


i.e.,  y  r  .  Note  that  the  above  exact  cancellation  based  scheme  is  not  the  only  way  to 
stabilize  the  augmented  plant  in  the  ^  coordinates.  In  fact,  as  we  will  see  later  on,  such 
an  exact  cancellation  may  make  the  global  nature  of  the  nominal  result  not  robust,  i.e., 
perturbations  in  the  states  (other  than  initial  conditions)  may  render  the  result  only  locally 
stable.  Hence,  it  is  crucial  that  one  studies  the  sensitivity  of  the  full-state  baised  control 
design  and  modify  it  if  possible  so  that  for  the  class  of  anticipated  disturbances,  the  local 
nature  of  the  control  law  is  satisfactory. 


2.3.2  Making  the  Separation  Principle  Work 

Consider  the  plant 

X  =  Ax  +  Bu  -f-  F{y) 
y  =  Cx 

with  {A,B,C)  a  minimal  triple.  Suppose  that  there  exists  a  nonlinear  coordinate  transfor¬ 
mation 

^  =  Hx) 

such  that  the  plant  can  be  equivalently  represented  as 

i  =  A(  +  B{u  +  a{0) 

y  = 

For  such  a  system,  asymptotic  state  construction  is  easier  in  the  x-coordinates,  since 

X  =  Ax  +  Bu  +  F(y)  -1-  L{y  -  y) 
y  =  Cx 

yields  an  estimation  error  e*  ;=  x  —  x  ,  where 

e'x  =  {A  -  LC)ex  . 

The  stabilizing  control  is  easier  to  express  in  the  ^-coordinates  since 

u  =  -K(  -  a(0 

renders  ^  =  0  globally  asymptotically  stable  since  the  above  control  yields 

{  =  {A-BK)(  . 
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The  heuristic  step  of  replacing  (  by  $(2)  can  only  guarantee  local  results.  For  “sufficiently” 
small  Cx  ,  the  estimated-state  feedback  will  be  locally  stabilizing.  The  crucial  points  are  how 
large  the  basin-of-attraction  is  and  if  it  is  possible  to  enlarge  it  to  yield  a  satisfactory  design. 

Let  (  :=  $(x)  .  The  closed-loop  dynamics  under  the  feedback  law 

IX  =  -K(~a{0 


can  be  expressed  as 

f  =  (A-  BK)(  + 

e'x  =  {A  —  LC)ex 


dx 


(  LCex  +  F{C^-\()  +  ex))  -  F{C^-\i))  ) 


M{i) 


G(lex) 


Even  if  F  is  Lipschitz  (hence  ||G(|^,ex)|l  depends  only  on  |lexl|),  ^(i)  need  not  be  Lips- 
chitz.  Hence  establishing  and  modifying  the  basin-of-attraction  are  nontrivial  design  tasks. 
However,  the  crucial  observation  is  that  the  stabilization  problem  is  in  fact  the  robustness 
of  the  full-state  feedback  law  subject  to  L2  disturbances. 

The  following  example  illustrates  the  need  for  a  robust  state-feedback  law  and  the  sensi¬ 
tivity  of  the  exact-linearization  based  control  law. 


2.3.3  Example  2 

Consider  the  following  one  state  plant  model  V  :u*-^  y 

y  =  y^  +  u 

where  the  goal  is  to  render  y  =  0  globally  asymptotically  stable.  The  perfect  state  (namely 
the  plant  output)  is  not  available  due  to  an  unknown  output  disturbance  d,  where  d  E  L2 
and  d  E  Loo  •  The  fact  that  d  0  motivates  the  control  law  candidate 

u  -  -  ay 

with  a  >  0  .  We  show  that  such  an  exact  cancellation  based  control  law  yields  only  locally 
stable  results,  whereas  choosing  a  nonlinear  control  law  for  the  linearized  plant  significantly 
improves  the  results. 

The  motivation  behind  the  choice  of  d  above  is  possible  asymptotic  estimation  error 
subject  to  initial  condition  mismatches.  The  over-simplicity  of  this  one-state  plant  should 
not  be  misleading,  since  even  if  y(0)  is  known  perfectly,  the  estimator  built  for  plants  of  the 
form  X  =  Ax  +  Bu+F{y)  with  y  available,  will  not  necessarily  guarantee  zero  state-estimation 
error.  In  fact,  if  one  were  to  build  an  estimator 

y  =  u  -f  +  (y  -  y) 


18 


d 


Figure  11:  Control  under  perturbed  state  measurement 


and  base  the  control  law  on  y,  by  setting  d  —  y  —  y  ,  one  obtains 

d  =  -d  (2) 

which  implies  the  L2  disturbance  in  the  plant  states  (see  Figure  11). 

Consider  the  controller  of  the  form 

C(ot,P)  :  y^u  ,  u  =  -ay-^y^  . 

for  a  >  0  and  /3  >  1  and  the  disturbance  d  as  in  (2).  The  dynamics  of  the  resulting 
closed-loop  system  in  Figure  11  is  described  by 

y  =  -a{y d)  +  y^  - /S(y  +  df  (3) 

d  =  -d  (2) 


Without  getting  into  extensive  simulations  to  obtain  the  phase-portraits  for  different  a 
and  values,  it  is  useful  to  derive  some  qualitative  properties  of  (3)  and  (2)  analytically. 

For  a  >  0  and  ^  >  1  ,  (y,  d)  =  (0, 0)  is  the  only  equilibrium  point.  The  Jacobian  at  (0, 0) 
has  eigenvalues  —a  and  —1  ,  with  associated  eigenvectors  (y,0)  and  (y,  ^^y)  ,  respectively. 
Since  d{t)  cannot  change  sign,  the  phase  portrait  can  be  decomposed  into  three  invariant 
sets:  d  >  0  ,  d  =  0  and  d  <  0  ;  i.e.,  the  trajectories  remain  exclusively  in  the  set  that  the 
initial  condition  belongs  to.  There  is  an  odd  symmetry  in  the  flows  since  (y,d)  and  (— y,  — d) 
satisfy  the  same  differential  equations.  Hence,  it  suffices  to  consider  the  flow  for  d  >  0  since 
d  =  0  case  is  asymptotically  stable  for  all  y(0)  . 

The  purpose  of  the  exercise  is  the  choice  of  the  nonlinear  control  determined  by  /?  ; 
although  the  basin-of-attraction  will  be  affected  by  the  choice  of  a  ,  the  qualitative  properties 
for  /5  =  1  and  /3  >  I  are  generic.  In  the  following,  we  will  consider  the  case  where  a  =  1, 
(i.e.,  the  eigenvectors  are  colinear).  Let  the  Lyapunov  function  candidate  be  V"  =  ^{y^-\-cP)  . 
Evaluating  V  along  the  solutions  of  (3)  and  (2)  ,  we  obtain 

V  =  {I  ~  /3)y^  -  y^  -  (1  +  3^y^)d2  -  yd(l  +  3/3y^  +  pd^)  .  (4) 

From  (4),  one  can  easily  deduce  the  following: 

•  P  >  I  y  is  eventually  negative,  due  to  the  dominating  first  term  in  (4)  .  Hence,  y 
remains  bounded  for  any  (y(0),d(0))  .  Once  d  gets  sufficiently  small  ,  y  —y  0  . 
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•  /3  =  1  The  first  term  in  (4)  drops.  The  second  and  third  are  nonnegative.  The  last 
term  changes  sign  according  to  (j/,d)  in  the  first  or  second  quadrant.  Clearly,  for  any 
(i/(0),d(0))  in  the  first  quadrant,  the  volume  V  decreases,  however,  the  trajectories 
might  continue  into  the  second  quadrant,  where  the  last  term  becomes  positive.  We 
now  show  that,  finite  escape-time  behavior  is  indeed  expected  for  sufficiently  large 
d(0)  .  For  this  specific  case,  note  that  (3)  becomes 

y  —  —y  —  doe~^  —  3y^doe~^  —  3ydle~^^  —  .  (5) 


Considering  the  quadratic  term  above,  take  the  approximate  solution 


(t-a) 


about  the  approximate  escape-time  a  >  0  .  Substituting  (6)  in  (5),  we  obtain 


(t  —  ay  (t  —  a) 


doe-^-3j-^ 
—  a)  [t  —  ay 


doe-'  -  3^: - 


(<  - »)  “ 


Multiplying  both  sides  by  (t  —  a)^  and  evaluating  at  a,  we  obtain 


ae  = 


The  above  equation  predicts  possible  finite-escape  time  for  do  >  ^  .  In  fact  for 
3/(0)  =  —1  and  do  >  0.8  ,  simulations  indicate  finite  escape-time  behavior  in  the 
second  quadrant. 
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Appendix  A 


Appendix  A  includes  a  regular  paper  presented  at  the  31st  IEEE  Conference  on  Decision  and 
Control,  held  in  Tucson,  Arizona  on  16-18  December  1992  and  published  in  the  Proceedings 
of  the  31st  IEEE  CDC  pp.  251-256  .  Appendix  A  is  self-contained,  hence,  all  section,  equa¬ 
tion  and  figure  cross-referencing  within  these  pages  pertain  only  to  Appendix  A  .  An  abridged 
version  of  the  preliminary  stages  of  this  work  has  also  been  published  in  the  Proceedings  of 
the  1992  Yale  Workshop  on  Adaptive  and  Learning  Systems. 
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Abstract 

The  robustness  to  parameter  mismatch  in  feedback  linearization  based  nonlinear 
tracking  systems  is  investigated.  The  certainty-equivalence  principle  gives  rise  to  four 
possible  feedback  configurations:  One  is  the  widely  used  case,  where  the  states  are 
assumed  to  be  available;  two  others  are  observer-based;  the  last  one  is  a  model-follower 
based  on  a  feedforward/feedback  implementation  using  a  signal  generator.  It  is  noted 
that  the  unacceptable  transient  behaviour  of  the  adaptive  tracking  scheme  is  closely 
related  to  the  sensitivity  of  the  underlying  certainty-equivalence  based  control,  which  is 
analyzed  through  a  perturbation  approach.  Simulations  are  performed  on  an  example 
to  illustrate  the  points. 
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1  Introduction 


Consider  the  following  nonlinear  tracking  problem: 

•  For  a  given  nonlinear  plant  with  state-space  description 

X  =  f{x,u,e) 

6  =  0 

y  =  h(x,u,6)  , 

and  a  given  constant  reference  signal  r  ,  determine  a  feedback  law  u  =  C{y,r)  such 
that  the  closed-loop  system  is  internally  stable  and  the  output  y  asymptotically  tracks 
r  . 

The  parameter  6  is  clearly  uncontrollable.  The  desired  control  law  has  to  be  expressed  in 
terms  of  the  measured  plant  variables  ;  such  control  laws  will  be  classified  as  implementable  . 
The  goal  is  in  fact  a  robust  tracking  controller,  since  performance  is  achieved  despite  the 
uncertainty  in  the  uncontrollable  variable  6  . 

When  6  is  exactly  known,  the  tracking  problem  has  been  solved  for  special  classes  of 
plants  that  can  be  rendered  linear  under  algebraic  state-feedback  and  change  of  coordinates 
(e.g.,  [Isil]  and  Section  II  of  [Sasl])  . 

When  6  is  unknown,  the  widely  used  approach  to  design  an  adaptive  tracking  scheme  is 
the  certainty-equivalence  principle:  the  tracking  problem  is  solved  as  if  6  is  known  and  then 
the  above  control  is  based  on  the  estimate  6  (see  [Sasl]  and  the  references  therein)  . 

There  are  two  problems  with  the  above  approaches.  First,  the  states  must  all  be  sensed 
variables,  which  is  often  not  the  ca.se;  hence  the  controllers  are  not  implementable  in  our 
sense.  Secondly,  certainty-equivalence  based  designs  are  not  guaranteed  to  be  robust,  and 
hence,  the  adaptive  tracking  performance  may  be  unacceptable  due  to  poor  transient  be¬ 
haviour,  a  phenomenon  well  known  in  the  adaptive  linear  control  (e.g.,  [Andl])  . 

A  solution  to  the  implementability  problem  is  to  use  an  estimate  of  the  state  in  the 
controller.  Unfortunately,  there  is  no  dual  theory  of  observer  design  for  feedback  linearizable 
systems.  Adaptive  controllers  based  on  observers  have  been  obtained  in  the  literature  for 
specific  classes  of  nonlinear  plants  (e.g.,  under  output-matching  conditions  in  [Kanl])  . 

Addressing  the  second  problem  requires  designing  a  robust  nonlinear  tracking  controller 
before  any  adaptation  is  introduced.  Hence,  prior  information  on  the  plant  description  and 
the  parameter  uncertainty  are  crucial  to  improve  the  transient  behaviour. 

In  this  paper  we  consider  a  special  class  of  nonlinear  plants.  A  direct  interpretation  of 
the  certainty-equivalence  principle  gives  rise  to  four  possible  feedback  configurations. 

The  first  one  is  the  widely  used  case,  where  the  states  are  assumed  to  be  available.  De¬ 
spite  the  underlying  implementability  problem,  we  address  the  issue  of  sensitivity  through 
a  perturbation  analysis  of  the  approach.  Simulations  are  performed  on  an  example  to  illus¬ 
trate  the  points.  Beised  on  this  tracking  scheme,  we  derive  an  adaptive  tracking  controller. 


Performance  of  the  tracking  scheme  is  illustrated  in  an  example.  Although  the  responses 
are  bounded,  the  transients  are  very  large.  This  unacceptable  transient  behaviour  is  closely 
related  to  the  sensitivity  of  the  underlying  (non-adaptive  frozen  parameter  6)  certainty- 
equivalence  based  controller. 

The  other  three  approaches  are  all  implementable.  The  state  availability  a.ssumption  is 
dropped  at  the  expense  of  global  results.  Two  of  the  schemes  use  the  state-estimate  in  two 
different  coordinates.  The  third  is  a  new  model-follower  scheme.  An  example  system  is  used 
throughout  to  illustrate  the  system  behaviour  using  the  four  schemes. 


2  Plant  Description 

We  consider  a  class  of  single-input  single-output  nonlinear  plants  which  can  be  exactly  input- 
output  linearized  under  algebraic  state-feedback  and  change  of  coordinates.  The  class  under 
consideration  has  no  zero  dynamics.  Specifically,  we  assume  the  following: 

1.  The  plant  dynamics  has  the  state-space  description 

X  =  Ax  +  Bu fo{x)  +  Fi{x)6  ,  . 

T  V-*- J 

y  =  c  X 

where  x  6  ffP*  is  the  state,  u  G  IR  is  the  scalar  control  input,  y  G  IR  is  the  scalar  sensed 
output. 

2.  The  matrices  A  and  B  are  in  the  Brunowsky  canonical  form.  c^:=[100  ...  0]. 

3.  The  smooth  nonlinear  functions 

/o  :  IR"  IR"  and  Fi  ;  IR”  —>■  IR"’'*’  are  known  and  have  the  lower-triangular  forms 
=  fo{i){xi  ,X2,  ...  ,Xi)  and  Fi(..j)(x)  =  /i(i.»(xi  ,  X2  ,  . . .  ,  x.)  for  f  =  1, . . . ,  n  , 
j  =  l,...,p  . 

4.  0  G  IR*”  is  a  vector  of  unknown  constant  parameters. 


2.1  Remarks 


For  the  above  plant  description,  there  exists  a  possibly  parameter  dependent,  coordinate 
transformation,  (  =  $(x,0)  ,  where  $(-,0)  is  a  diffeomorphism  for  all  6  .  Under  the 
coordinate  transformation  $  ,  (1)  can  be  equivalently  described  by 

^  =  A^  +  B{u  +  go{x)  +  Gi{x)ip]  ^2) 

where  the  over-parametrization  ■0  consists  of  entries  of  appropriate  tensor  products  of  0  . 

The  class  of  plants  under  consideration  are  transformable  to  —  u  9o{x)  +  Gi{x)ip  ; 
hence,  they  are  state-dependent  perturbations  of  a  string  of  n-integrators. 
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For  notational  convenience,  we  will  refer  to  the  linear  part  in  (2),  i.e.,  the  triple  (/,  A,  B) 
as  the  linear  plant  P  .  Let 

T{x)  :=  go{x)  +  Gi{x)i}}  .  (3) 

This  class  of  plants  satisfies  the  parametric  strict-feedback  condition  in  [Kan2]  ;  hence, 
provided  that  the  state  x  is  available  for  feedback,  one  can  obtain  a  globally  adaptive  tracking 
system  for  the  plant  in  (1)  using  the  procedure  in  (Kan2]  . 

The  class  of  plants  under  consideration  include  the  so-called  benchmark  example  ,  where 
n  =  3,  p  =  1,  fo{x)  =  0  and  Fy{x)  ~  [/(^i)  0  0]^  ,  where  /(•)  is  not  necessarily  Lipschitz  in 

Xi  . 

We  will  denote  “state-x  is  available  for  feedback”  rather  than  “state  is  available  for 
feedback”  ,  in  order  to  emphasize  the  specific  coordinate  the  dynamics  is  described  in.  Note 
that  even  if  state-x  in  (1)  is  available,  state-^  is  not  available,  since  it  depends  on  a  6- 
dependent  transformation  where  6  is  unknown. 

Whenever  we  refer  to  a  stable  map,  the  map  is  causal  and  bounded-input  bounded-output 
stable  (defined  over  an  appropriate  extended  space).  With  a  slight  abuse  of  notation,  a  stable 
proper  transfer  function  h{s)  will  be  denoted  as  h  in  the  input-output  description  where 
hG{x)  will  denote  the  time-domain  waveform  obtained  by  convolving  the  impulse  response 
corresponding  to  h{s)  and  the  signal  G{x)  .  Parantheses  will  be  used  where  ambiguity  arises. 
Unless  specifically  emphasized,  factorizations  denote  proper  stable  factorizations. 


2.2  Example 


A  simple  third-order  example,  satisfying  the  conditions  in  Section  2,  is  used  to  illustrate 
certain  points  throughout  the  paper.  Consider 


ii  =  X2  -f-  6f{xi) 
X2  =  X3 
X3  =  u 
y  =  xi  . 


(4) 


Since  we  are  interested  in  an  input-output  approach,  all  initial  conditions  in  the  simulations 
are  assigned  as  zero. 

Candidates  for  the  function  /(•)  that  we  have  used  are; 


•  /(®i)  =  —  tanh(aii)  ,  motivated  by  a  simple  saturating  friction  nonlinearity.  Note 
that  although  /  is  Lipschitz,  the  corresponding  Gi  is  not. 

•  f(^i)  =  )  following  the  benchmark  problem  in  [Kan2]. 

•  f(xi)  =  -x\l{ax\  -f  1)  . 


The  function  /  used  in  a  particular  simulation  will  be  emphasized.  We  introduce  the  follow¬ 
ing,  for  future  reference. 
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$(l)  = 


11 

12  +  0f{xi) 

X3  +  Ox2f^'^\xi)  +  0^f(xi)f^^\xi) 


G^{x) 


Note  that  for  this  example, 


6  -  Wi) 


5fo(a:)  :=  0 

2x2f{xi)^\xi)  +  X2{f^^'>(xi)y 

.  fix,)(f(^\xr)y  +  P(x,)f(^)(x,)  J 

^:=[e  6^ 


G,{x)^  = 

=:  G,{i)6  , 


(5) 


which  emphasizes  the  issue  of  overpaxametrization  in  different  coordinates. 


Another  observation  which  becomes 
^  where 


useful  in  the  perturbation  description  is  the  map 


$ 


6-^/(6) 

.6-06^6). 


=  m  +  («-«) 


0 

/(6) 


(6) 


3  State-x  Available 

3.1  On  Sensitivity  of  State-x  Based  Stabilization 

The  contents  of  this  particular  subsection  focuses  on  a  slight  generalization  of  the  class  in 
Section  2  ,  where  the  input  u  need  not  be  scalar. 

Consider  the  input  u  to  state-i  map  described  by  (1)  or  equivalently,  by  (2)  .  Under 
a  change  of  coordinates  ($)  ,  the  input  u  to  state-^  map  can  be  rendered  linear  under  an 
algebraic  state-feedback  ^  (see  (3))  .  Hence  u  to  state-i  map  can  be  realized  as  in  the 
dashed-box  in  Figure  1  . 

The  map  from  u  to  ^  is  linear  (time-invariant  finite  dimensional).  Let  (  =  Pv  .  Since  P 
admits  coprime  factorizations,  P  =  ND~^  =  D~^N  with 
where  all  eight  maps  in  the  above  identity  are  stable. 


u 

V 

N 

-V 

-D 

N 

D 

U 
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The  nonlinear  map  $  is  stable  and  its  inverse  is  stable. 

The  algebraic  map  T  is  stable.  In  fact,  for  the  subsequent  derivations  in  this  subsection, 
$  and  T  need  not  be  algebraic. 


Figure  1;  Stabilizing  scheme  based  on  the  estimates  T  and  $ 
Note  that,  the  input  to  state-z  map  in  Figure  1  can  be  expressed  as 


N{D  -  N)-^  (7) 

=  (8) 

where 

+  {V){D-T^-^N)  =  I  (9) 

(M  -  V)  +  (NXU  +  =  i  (lo) 


Note  that,  (9)-(10)  imply  that  (7)-(8)  are  in  fact  nonlinear  right-  and  left-coprime  factor¬ 
izations  of  Vx  ,  respectively.  The  equations  (9)  and  (10)  also  describe  a  stabilization  scheme 
based  on  undoing  the  nonlinearities,  since  a  stabilizing  C  :  i  u  is  given  by 

C  =  -K^  -  T  , 

where  v  =  —K(  stabilizes  P  . 

In  order  to  incorporate  the  6  dependence  in  the  u  to  i  map,  consider  $  and  both 
functions  of  6  whereas  P  is  independent  of  6.  Note  that  such  an  tissumption  is  not  restrictive, 
since  typically  the  choice  of  coordinates  and  algebraic  state-feedback  is  constructed  to  render 
P  as  a  string  of  integrators  (in  each  channel).  From  now  on,  P  and  its  associated  terms  in 
the  Bezout-identity  will  be  considered  as  ^-independent.  The  9  dependence  in  $  and  T 
will  be  emphaisized  by  introducing  the  ""-versions  when  they’re  determined  by  the  parameter 
estimate  6  . 

We  now  can  express  the  standard  stabilization  scheme  based  on  certainty-equivalence 
approach  in  terms  of  the  maps  introduced  so  far.  Since  0  is  not  exactly  known,  the  control 
law  from  z  to  u  is  realized  as 

u  =  ,  (11) 
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where  denotes  the  desired  reference  in  terms  of  the  state-^  (see  Figure  1)  .  In  other  words, 
when  6  =  6,  the  tracking  performance  is  determined  by 

i  =  Pv 

V  =  K(ir-0  • 

Note  that  the  tracking  scheme  in  (11)  can  be  equivalently  represented  by  the  control  law, 

«  =  K((,  -  0  , 

where  K  is  a.  ^^-tracking  compensator  for  P  :  v  ^  ^  ,  and  v  ,  ^  are  as  shown  in  Figure  2  . 


Figure  2:  Perturbation  of  P  :  v  k-»  ^  under  6  mismatch 

Note  that  the  stable  feedback  perturbation  in  Figure  2  can  be  expressed  as  {T  —  , 

since  x  =  $“^(0  is  available,  although  $  and  ^  are  not.  As  long  as  the  feedback  law 
from  ^  to  u  is  chosen  as  a  linear  time-invariant  K  stabilizing  P  =  ND~^  ,  one  obtains  the 
precise  conditions  under  which  the  certainty-equivalence  bcised  control  law  can  withstand  6 
perturbations.  Before  stating  the  conditions,  we  will  adopt  the  following  definition. 

•  Let  K  stabilize  P  (i.e.,  K  =  V~^U  and  P  =  ND~^  ,  where  UN+VD  =  I)  ■  K  is  said  to 
stabilize  the  perturbed  plant  in  Figure  2  iff  the  control  law  v  =  U2-\-  K{u\  —  ^  yields  a 
stable  (ui,'U2)  z  ,  where  z  denotes  the  pseudo-state  in  Figure  2  (see  also  Figure  1)  . 


With  this  definition,  one  can  work  out  the  pseudo-state  equation  to  derive  the  necessary 
and  sufficient  condition  for  K  to  robustly  stabilize  the  perturbed  u  to  ^  map,  i.e.. 


{J  + 


\  V  u] 

■  -  .P)$-‘  ■ 

L  J 

N) 


(12) 


has  a  stable  inverse.  Note  that  P  can  be  stabilized  using  only  its  first  output,  i.e.,  there 
exist  U  =:  [  0  0  . . .  0  ]  ,  such  that  U N  +  V D  =  I  .  Hence  using  a  dynamic  feedback 

compensator  of  the  form  K  =:{  C  0  0  ...  OIjC  stabilizes  P  ;  moreover,  the  map  in  (12) 
further  simplifies  to  (  /  -|-  V{T  —  )  since  —  7)  =  0  . 

Typical  robustness  results  (small-gain,  passivity  based  sufficient  conditions)  can  be  uti¬ 
lized  together  with  assumptions  on  the  nonlinear  post-multiplicative  and  feedback  pertur¬ 
bations  in  (12)  (such  as  sector  bounded  nonlinearities,  linear  or  multilinear  6  dependence 
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etc.)  to  generate  classes  of  systems  for  which  (12)  can  be  justified.  However,  we  are  not 
interested  in  further  restricting  the  set  of  plants  under  investigation  for  the  sake  of  forcing 
some  sufficient  conditions.  For  the  simple  example  in  Section  2.2  ,  even  if  the  nonlinearity 
/(•)  is  chosen  to  be  globally  Lipschitz,  the  same  is  no  longer  true  for  T  . 


3.2  Sensitivity  to  ee  :  Simulation 


The  example  in  Section  2.2  with  f(xi)  =  — xj  is  used  for  the  following  simulation. 

Recall  that,  since  (\  =  Xi  —  y  ,  from  (6)  ,  —  7)  =  0  .  Hence,  by  choosing  a 

dynamic  compensator  C  from  y  to  u  ,  the  error  introduced  by  the  output  multiplicative  term 
in  Figure  2  is  avoided. 

The  compensator  C  :  y  v  is  chosen  as  an  'H2-o^iim&\  compensator  for  1/s^  for  a 
particular  choice  of  weights.  Throughout  the  simulations,  <^(5)  =  >+9^^'?9^z4.35°33l3*'^6  6297  • 


The  sensitivity  of  the  closed-loop  io  eg  6  ~  6  is  illustrated  as  follows:  The  parameter 
9  is  chosen  as  0  =  1  .  For  a  unit-step  reference  input  the  parameter  estimate  6  is  perturbed 
about  the  nominal  value  1  .  When  6  =  6,  the  closed-loop  system  exhibits  the  desired 
tracking  response  of  a  linear  system.  When  6  is  perturbed  to  1.0025  and  1.005  ,  the  tracking 
performance  of  the  loop  is  shown  in  Figure  3  . 


Figure  3:  Sensitivity  of  certainty-equivalence  based  design  using  state-i  for  0  —  \  ,  6  = 
1  ,  1.0025  ,  1.005  . 

Clearly,  within  0.5%  relative  error,  the  asymptotic  tracking  property  is  lost  and  the 
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system  goes  unstable.  An  adaptation  law  ba.sed  on  this  sensitive  closed-loop  is  bound  to 
exhibit  unsatisfactory  transient  behaviour  even  if  asymptotic  tracking  is  achieved  by  param¬ 
eter  adaptation.  Note  that,  Figure  3  shows  the  closed-loop  responses  for  frozen  6  values. 
Hence,  one  should  definitely  avoid  very  slow  adaptation. 


3.3  Adaptation 

For  the  plant  description  in  Section  2  ,  provided  that  the  state-i  is  also  available  for  exact 
linearization,  one  can  bring  an  input-output  approach  to  a  particular  case  of  certainty- 
equivalence  based  adaptive  control  design. 

We  now  outline  the  design  procedure; 

1.  For  the  class  of  plants  in  Section  2  ,  determine  go  and  Gi  ,  reducing  zero  columns  of 
Gi  (if  any)  to  cut  down  r  inecessary  overparametrization  in  tj)  . 

2.  Design  a  stable  stabiiiz  ig  compensator  C(s)  for  1/s"  ;  i.e.,  C  =:  Uc/dc,  1/s"  =:  nj,ldp 
and  TipTic  dpdc  ~  1  . 

3.  An  adaptive  tracking  control  law  candidate  is  determined  by  applying  the  control 

u  =  C{r  -y)-  goix)  -  {Gix)tP  ,  (13) 

where  the  parameter  estimate  is  updated  using  the  error  equation 

y  -  npHcT  =  dcnp{(Gix)e,i,)  ,  (14) 

so  that  y  — »  UpU^r  .  (e^  =  V’  —  V’  and  r  denotes  a  reference  signal  that  tracks  a  step 
input.) 

Note  that  {y  —  npUt-r)  in  (14)  denotes  the  error  between  the  plant  output  and  the  ideal 
tracking  performance  (if  the  nonlinear  cancellations  were  exact)  . 

Consider  the  nonlinear  plant  described  in  Section  2  shown  in  Figure  1.  For  the  scalar 
input  case,  let  D  =  dp  . 

The  pseudo-state  equations  can  be  written  as 

dpZ  —  u  -h  go{x]  -h  Gi{x)ijj 
Nz  =  ( 

y  =  (i  , 

where  N  and  dp  are  stable;  c^N  =:  Up  and  (rip,  dp)  is  a  right  coprime  factorization  of  1/s"  . 
Since  Up  is  minimum-phase,  ripd"^  is  strongly  stabilizable.  Hence  there  exist  n^idc  with  d~^ 
stable,  satisfying  UcUp  dcdp  =  1  . 
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Since  N  is  strictly  proper,  Q  is  strictly  causal,  where 

g  =  Go  +  igi{-)H 

GqZ  =  gox  (15) 

Qiz  =  GiX  .  (16) 

Hence, 

dpZ  =  u  +  Qz 

describes  a  well-posed  u  to  z  map.  Since  $  and  are  stable  maps  and  {rip, dp)  is  a 
right-coprime  pair,  the  plant  can  be  equivalently  represented  by  the  following  pseudo-state 
equation 

dpZ  =  u-\-  Qz 
TipZ  =  y  . 

Note  that  although  z  is  not  available,  GqZ  and  QiZ  are  available  since  state-i  is  available 

((15), (16))  . 

Hence  the  plant  description  from  u  to  y  can  be  expressed  as 

y  ~  Up  {dp-  gy'  u  . 

Now  apply  the  certainty  equivalence  based  control  law 

7t 

u  =  U2  +  -j-{ui  -y)-  Goz  -  (^iz)V’ 

O-c 

where  ui  and  denote  the  exogenous  additive  inputs  at  the  plant  input  and  output  in  the 
standard  unity-feedback  system,  respectively. 

For  U2  =  0  ,  ui  =  r  ,  where  r  denotes  the  desired  reference,  we  obtain 

y  -  ripTi^r  =  <icnp((^i2)e^)  . 

Since  dcUp  is  minimum-phase,  one  can  utilize  an  augmented  error  scheme  to  update  e^  . 

In  order  to  cut  down  on  the  number  of  states  introduced  by  the  filters  in  the  augmented 
error  scheme,  we  use  the  filtered  error  Cy  , 

Cy  :=  dy{y-npncr)  .  (17) 

Since  Tip  is  minimum-phase,  one  can  adapt  ^  using  the  error  form 

e„  =  np((^iz)e^)  .  (18) 

Since  {dcdp){0)  =  0  ,  when  cancellation  is  exact,  step  inputs  can  be  asymptotically 
tracked. 

We  now  derive  the  update  law,  based  on  the  error  equation  in  (18)  .  Since  it  utilizes  the 
standard  augmented  error  scheme,  we  briefly  outline  the  steps. 
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Since  rip  is  minimum-phase  and  strictly  proper,  it  can  always  be  expressed  as  rip  =:  /11/12  , 
where  hi  is  strictly  positive  real,  strictly  proper  and  A2  is  proper  stable.  Note  that  hi  = 
and  h2  =  {s  l)np  will  always  work,  since  rip  is  strictly  proper.  However,  to  cut  down  on 
the  number  of  states  introduced  by  filtering  by  ^2  in  the  augmented  error  implementation, 
it  is  advisable  to  factor  out  a  strictly  positive  real  factor  of  rip,  instead. 

For  ease  of  notation,  let  W  :=  QiZ  .  Then  (18)  can  be  written  as  =  hih2{W e^)  .  After 
adding,  subtracting  terms  and  using  the  fact  that  V*  =  0  ,  we  obtain 

7,  =  hi{{h2W)e^)  , 

where  rj  :=  Cy  —  hi{{h2W)^  —  /i2(kF^))  •  Since  hi  is  strictly  positive  real,  the  update  law 

^  =  'y{h2W)'^r]  ,  7  >  0 

guarantees  that  7/  E  £2  FI  Coo  ^^nd  E  Coo  ■  Provided  that  the  closed-loop  signals  are 

bounded,  W  E  Coo  ;  hence,  rj  0  ,  •ij}  0  and  e^  — >  0  .  This  concludes  that  set  point 
tracking  is  achieved  in  the  limit  provided  that  the  closed-loop  signals  remain  bounded. 

3.3.1  Adaptive  Tracking  Scheme  :  Simulation 

The  adaptation  scheme  above  is  applied  to  the  example  in  Section  2.2  for  two  choices  of 
/  .  In  both  adaptive  schemes,  the  closed-loop  signals  remain  bounded;  hence  asymptotic 
tracking  is  achieved.  However,  the  transient  responses  are  unacceptable.  Note  that  in  both 
cases,  even  without  adaptation,  state-i  based  certainty-equivalence  control  performance  was 
extremely  sensitive  to  constant  parameter  errors. 

•  /(^i)  —  +  1)  )  =  0-05  ,  7  =  10  ,  reference  r  =  I  ,  9  —  1.  This  function 

approximates  f{xi)  ~  —x\  reasonably  for  |xi|  <  2  .  The  tracking  performance  is 
shown  in  Figure  4. 

•  f{xi)  =  —  tanh(axi)  ,^  =  — l,a  =  4,7  =  l,  reference  r  =  0.1  .  The  tracking 
performance  is  shown  in  Figure  5. 

4  On  Implementable  Certainty-Equivalence  Based 
Controllers 

Recall  that  the  example  in  Section  2.2  can  be  expressed  in  x  or  ^  coordinates  as  in  (1)  and 
(2),  respectively.  Note  also  that,  due  to  (5),  there  is  no  need  for  overparametrization  in  the 
^  coordinates. 

If  the  parameter  vector  9  and  either  the  state  x  or  ^  are  known,  then  either  the  control 

u  =  v-9GiU)  (19) 
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Figure  4:  Asymptotic  tracking  of  r  =  1  from  zero  initial  conditions 


or,  in  terms  of  the  state  x,  the  control 

u  =  V  —  Gi{x)'>p  (20) 

provides  exact  input-output  linearization.  That  is,  the  system  v  >—y  y  is  linear  time-invariant, 
where 

(  =  A(-{-Bv 
y  =  Cl 

Let  C{s)  be  the  same  stabilizing  compensator  (for  (c^.  A,  B))  as  in  Section  3.2  .  Hence  v  = 
Gif  —  y)  ,  where  r  denotes  the  desired  reference,  achieves  the  desired  tracking  performance 
when  6  =  9. 

Since  the  parameter  vector  6  and  neither  state  x  nor  ^  are  known,  it  is  natural  to  replace 
them  in  (19)  and  (20)  with  estimates  6,  x,  and  C-  This  is  the  certainty  equivalence  principle. 
As  it  does  make  a  difference  which  observer  state  is  used,  we  consider  them  separately.  As 
in  Section  3.2  ,  the  following  simulations  are  based  on  f{xi)  =  — Xj  . 

4.1  Observer  in  the  State-x 

The  control  law  is 

X  =  {A  -  Lc'^)x Bu -i- Fi{x)9 Ly 

u  =  V  —  G\{x)ij}  ,  ^  ' 

where  (A  —  L<F)  is  strictly  Hurwitz.  Since  all  simulations  are  performed  with  zero  initial 
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Figure  5:  Asymptotic  tracking  of  r  =  0.1  from  zero  initial  conditions 

conditions,  provided  that  6  —  6  ,  the  nominal  performance  is  identical  to  that  of  the  linear 
equivalent. 

The  sensitivity  of  the  closed-loop  to  es  6  ~  9  is  illustrated  as  follows:  The  parameter 
9  is  chosen  as  5  =  1  .  For  a  unit-step  reference  input  the  parameter  estimate  9  is  perturbed 
about  the  nominal  value  1  .  When  9  is  perturbed  to  1.1  and  1.2  ,  the  tracking  performance 
of  the  loop  is  shown  in  Figure  6  . 


4.1  Observer  in  the  State-^ 


The  control  law  is 

t  =  iA-JJ)i+B[u  +  9Gr{0]  +  Ly  ,  . 

u  =  v-9Gi{t)  , 

where  {A  —  Lc^)  is  strictly  Hurwitz.  Since  all  simulations  are  performed  with  zero  initial 
conditions,  provided  that  9  =  9,  the  nominal  performance  is  identical  to  that  of  the  linear 
equivalent. 
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Figure  6;  Sensitivity  of  certainty-equivalence  based  design  using  the  estimate  of  state-x  for  0  =  1  , 

?=  1  ,  1.1  ,  1.2  . 

4.2  Model-Follower 

This  feedback-feedforward  control  scheme  is  based  on  a  locally  stable  unity-feedback  system. 
For  the  given  nonlinear  plant  P  :  tt  i/  ,  let  C-p{s)  locally  stabilize  the  plant  in  the  unity- 
feedback  system  about  the  reference  r  .  A  model  is  used  to  generate  the  feedforward  signals. 
The  model  is  a  copy  of  the  plant  description,  where  0  is  replaced  by  6  .  Since  all  states  are 
available  for  this  model,  exact  linearization  can  be  performed.  Let  C{s)  be  the  compensator 
that  is  being  used  so  far  in  the  previous  three  certainty-equivalence  based  designs.  Subscript- 
m  denotes  the  model  variables. 

The  control  law  is  (see  Figure  8)  : 

G{r  -y^)  (23) 

+  Gvivm  —  y)  • 

For  the  specific  example,  C-p  =  C  achieved  local  stabilization  about  the  unit-step  reference. 
6  is  perturbed  to  1.25,  1.5,  1.75  ,  2  ,  2.5  ,  2.75  and  3  ;  the  tracking  performance  of  the  loop 
is  shown  in  Figure  9  .  Note  that  this  scheme  exhibits  the  least  sensitive  tracking  design. 


= 

3/m  — 

— 

— 
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Figure  7:  Sensitivity  of  certainty-equivalence  based  design  using  the  estimate  of  state-^  for  0  =  1  , 
0  =  1  ,  1.25  ,  1.5  ,  1.75  ,  2  . 

5  Concluding  Remarks 

An  input-output  approach  has  yielded  some  insight  into  robustness  of  feedback  linearizable 
schemes  under  parameter  mismatch.  Our  simulations  of  adaptive  nonlinear  systems  show 
a  phenomenon  familiar  in  adaptive  linear  systems.  Namely,  even  though  the  system  is 
theoretically  globally  stable,  the  transient  can  be  quite  large.  The  problem  is  inherent  in  the 
parameter  sensitivity  of  the  system  when  the  adaptation  is  frozen.  The  observer-based  and 
model-follower  schemes  are  most  likely  more  robust  to  parameter  changes  then  the  full-state 
available  scheme,  which  in  general,  is  not  implementable  unless  the  states  are  measured. 


Figure  8:  Model-follower  described  by  (23),  where  C{vm,(m)  ■=  Vm  -  ■ 
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Figure  9;  Sensitivity  of  the  model-follower  based  certainty-equivalence  design  for  0  =  1  ,  0  = 
1  ,  1.25  ,  1.5  ,  1.75  ,  2  ,  2.5  ,  2.75  ,  3  . 
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Appendix  B 


Appendix  B  includes  a  regular  paper  submitted  for  possible  presentation  at  the  1993  Au¬ 
tomatic  Control  Conference  to  be  held  in  San  Francisco,  California.  Appendix  B  is  self- 
contained;  hence,  all  section,  equation  and  figure  cross-referencing  within  these  pages  pertain 
only  to  Appendix  B  . 
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Abstract 

An  input-output  approach  is  used  to  investigate  stabilization  of  a  class  of  nonlin¬ 
ear  plants  that  can  be  rendered  linear  time-invariant  under  nonlinear  stable  dynamic 
feedback.  Results  are  obt^ned  for  the  standard  unity-feedback  configuration  subject 
to  bounded  disturbances.  It  is  shown  that  the  design  method  based  on  applying  the 
inverse  transformation  on  any  stabilizing  controller  for  the  transformed  linear  time- 
invariant  plant,  does  not  necessarily  yield  a  stabilizing  controller  for  the  nonlinear 
plant.  Conditions  under  which  the  method  is  justified  are  derived. 
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1  Introduction 


Due  to  the  relative  ease  of  solving  control  problems  associated  with  finite-dimensional  linear 
time-invariant  (LTI)  plants,  the  class  of  nonlinear  plants  that  can  be  rendered  LTI  by  al¬ 
gebraic  state-feedback  has  received  considerable  attention  in  the  literature  (e.g.,  [Isil,  Sasl, 
Kanl]  and  references  therein)  .  This  transformation  based  approach  provides  one  of  the 
few  systematic  means  of  designing  nonlinear  control  laws  and  has  been  applied  to  numer¬ 
ous  research  problems  involving  stabilization,  tracking,  adaptive  stabilization  and  adaptive 
tracking.  While  most  of  these  applications  might  be  valid  nominal  design  approaches,  the 
stability  of  resulting  closed-loop  systems  subject  to  plant  uncertainties  and/or  disturbance 
models  has  not  received  comparable  attention. 

Towards  the  goal  of  establishing  robust  control  design  methodologies  for  classes  of  nonlin¬ 
ear  plants,  a  crucial  step  is  guaranteeing  stability  when  the  system  is  subject  to  persistently 
exciting  bounded  disturbances  and  determining  bounds  on  such  disturbances  for  which  sta¬ 
bility  is  guaranteed.  In  other  words,  design  methods  and  the  subsequent  analyses  must  be 
able  to  justify  stability  (be  it  local  or  global)  in  the  presence  of  sensor  and/or  actuator  noise, 
even  before  bringing  in  more  demanding  robustness  to  plant  uncertainties. 

In  this  paper,  motivated  by  the  class  of  nonlinear  plants  that  can  be  rendered  LTI  under 
nonlinear  algebraic  state-feedback,  we  focus  on  a  particular  class  of  nonlinear  plants  that 
can  be  rendered  LTI  under  nonlinear  stable,  possibly  dynamic,  feedback.  We  study  the 
stabilization  of  this  particular  class  of  nonlinear  plants.  The  study  is  based  on  an  input- 
output  framework,  where  the  closed-loop  results  are  stated  for  the  standard  unity-feedback 
configuration  subject  to  bounded  actuator  and  sensor  disturbances  [Desl]  . 

The  results  are  organized  as  follows.  The  relationships  between  the  graphs  of  the  nonlin¬ 
ear  plant  and  its  transformed  LTI  counterpart  are  derived;  the  fact  that  the  bounded  input- 
output  pairs  of  the  two  maps  are  related  by  stable  maps  motivates  the  generally  adopted 
design  procedure  based  on  undoing  the  nonlinearities,  which  consists  of  the  following  steps: 

1.  transform  the  nonlinear  plant  to  an  LTI  one, 

2.  design  a  stabilizing  controller  (possibly  nonlinear)  for  the  LTI  plant  and 

3.  using  the  inverse  transformation  and  the  designed  controller  for  the  LTI  plant,  deter¬ 
mine  the  transformed  controller  for  the  nonlinear  plant. 

It  is  shown  that  this  straightforward  three  step  procedure  is  not  a  unified  stabilizing  control 
design  method  for  the  class  of  plants  at  hand  when  the  plant  is  subject  to  bounded  actua¬ 
tor/sensor  disturbances  .  In  other  words,  given  any  member  of  the  particular  class  of  plants 
under  study,  taking  the  three  steps  mentioned  above  does  not  necessarily  guarantee  a  sta¬ 
bilizing  controller.  As  might  be  expected,  the  problem  arises  due  to  the  second  step  of  the 
design  procedure,  since  it  inherently  assumes  that  the  design  for  the  transformed  LTI  plant 
and  the  nonlinear  plant  are  separate  problems.  In  other  words,  any  stabilizing  controller 
for  the  LTI  plant  may  not  yield  a  stabilizing  controller  in  the  third  step.  Subsequent  results 
formalize  this  issue  and  state  conditions  under  which  such  a  design  approach  is  justified. 
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While  the  notion  of  incremental  stability,  which  generalizes  the  properties  of  linear  stable 
maps,  can  be  easily  imposed  to  guarantee  sufficient  conditions  for  the  method  to  work  in 
general,  the  extreme  conservatism  in  expecting  any  controller  in  the  second  design  step  to 
work  is  shown  by  establishing  that  incremental  stability  has  to  be  necessary  ,  as  well.  Unless 
such  restrictions  happen  to  be  satisfied  for  the  specific  problem  at  hand,  all  of  these  cau¬ 
tionary  results  imply  the  following:  The  design  problem  for  the  transformed  LTI  plant  need 
not  be  decoupled  from  the  original  nonlinear  design  problem;  hence,  one  should  not  rule  out 
intentionally  nonlinear  controller  design  for  the  transformed  LTI  plant.  Although  this  might 
seemingly  defy  the  purpose  of  the  transformation,  since  the  transformed  design  problem  is 
yet  another  nonlinear  control  design  problem,  it  also  emphasizes  the  need  for  more  results 
in  nonlinear  control  design  for  LTI  plants,  before  nonlinear  plants  are  transformed  to  linear 
ones.  Moreover,  despite  the  global  linearizing  transformation,  one  might  end  up  with  only 
a  locally  stable  interconnection.  Simple  examples  with  analytical  derivations  are  used  to 
emphasize  the  points. 


1.1  Notation  and  Preliminaries 

•  All  nonlinear  maps  in  this  study  are  causal,  multi-input  multi-output  and  defined 
over  appropriate  products  of  extended  Loo[0,  oo)  spaces.  For  a  thorough  treatment  of 
general  extended  spaces  within  the  input-output  approach  to  nonlinear  systems,  see 
e.g.,  [Desl]  .  For  notational  convenience,  for  functions  from  1R+  to  IR"  ,  the  associated 
set  of  bounded  signals  and  the  extended  space  will  ve  denoted  by  and  , 
respectively. 

While  most  of  the  observations  in  this  study  do  generalize  to  other  extended  spaces,  the 
particular  choice  of  sup  bounded  functions  on  IR4.  is  motivated  by  studying  the  stability 
properties  of  nonlinear  interconnections  subject  to  persistently  exciting  disturbances. 

An  n^-input  rio-output  causal  nonlinear  map  V  will  be  considered  as 

V  -.u  C  ^  , 

where  U  denotes  the  domain.  The  extended  space  is  a  means  of  incorporating  un¬ 
bounded  signals  in  the  study;  however,  one  must  note  that  although  Loo  C  Looe  , 
(I'oo)^  \  Looe  7^  0  I  where  (•)^  denotes  the  complement  of  the  set  with  respect  to  the 
set  of  all  functions  on  IR.f,  .  The  nonempty  intersection  arises  due  to  discontinuities 
which  are  not  jump-discontinuities.  The  scope  of  the  extended  spaces  do  not  cover 
signals  which  exhibit  “finite  escape  time”;  hence  the  plant  description  over  a  strictly 
proper  subset  of  the  input  extended  space  might  be  necessary.  Hence,  describes  the 
set  of  bounded  signals  and  Looe  \Loo  denotes  the  set  of  unbounded  signals  (unbounded 
at  infinity). 

•  Calligraphic  capital  letters  will  be  used  to  denote  nonlinear  maps.  Italic  capital  let¬ 
ters  will  denote  linear  time- invariant  maps  that  admit  finite-dimensional  state-space 
descriptions;  for  this  class,  with  a  slight  abuse  of  notation,  the  map  and  its  associated 
transfer  function  representation  will  be  used  interchangeably.  In  the  case  that  italic 
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letters  are  used  for  nonlinear  algebraic  maps,  parantheses  will  be  included  to  emphasize 
the  evaluation  (i.e.,  Ax  vs.  F(x))  .  For  two  nonlinear  causal  maps  J-  and  Q  the  map 
will  denote  the  composition  of  the  two  maps. 

•  In  an  input-output  approach  to  the  analysis  and  design  of  nonlinear  interconnections, 
the  notions  of  boundedness  and  stability  are  crucial  for  subsequent  results.  Unlike  the 
finite-dimensional  linear  time-invariant  case,  most  of  these  properties  depend  on  the 
particular  framework.  The  following  four  definitions  set  up  the  particular  framework  in 
this  paper.  For  a  treatment  of  related  topics  in  fractional  representations  of  nonlinear 
casual  maps  and  stability  of  nonlinear  interconnections  see  e.g.,  [Haml,  Verl,  Vidl, 
Kabl]  and  references  therein. 

•  The  set  of  all  input  output  pairs  of  a  given  map  does  admit  a  special  partitioning  which 
exhibits  particular  properties. 

Definition  1  (graph  of  V,  ^("P)) 

For  a  given  map  V  :  U  C  ^  ,  the  graph  of  V  is  denoted  by  Q{V)  ,  where 

g{V)  =  U  X  V{U)  . 

Moreover,  Q['P)  admits  the  following  unique  partitioning 

g{V)  =  Ub^  {V{U))g  U  ZYfl  X  {V{U))^  U  Wt;  X  {V{U))b  \J  UvX  (P(Z^))c/  . 

%.  ■  -  — .  ^  .11  V  I  ^  ...-i  /  ^  ^  s  I.I  I  —I  / 

Qbb{V)  Qbu{V)  Qub{-P)  QuuiT) 

where  the  subscripts  B  and  U  denote  the  bounded  and  unbounded  components,  re¬ 
spectively, 
o 

In  the  bounded  and  unbounded  partitioning  of  g{V)  ,  denotes  the  desired  i/o 

pairs,  describes  the  instabilites  of  the  plant  (“unstable  poles”)  and  ^(/b(P) 

describes  the  instabilites  in  the  inverse  relation  (“unstable  zeros”)  .  Clearly,  a  feedback 
system  stabilizes  V  subject  to  bounded  disturbances  if  and  only  if  the  closed-loop 
generates  plant  i/o  pairs  in  ^bb(P)  •  In  other  words,  ^bb(P)  must  be  nonempty.  For 
example,  for  the  particular  map  V  from  u  to  y  where 

y  =  1  -f- 

has  ^Bfl(P)  =  0  )  hence  there  cannot  exist  any  stabilizing  scheme. 

•  Unless  specifically  emphasized,  maps  are  not  considered  to  be  stable  or  algebraic. 

Definition  2  (stable) 

A  causal  map  H  —*  is  said  to  be  stable  iff  there  exists  a  continuous 

nondecreasing  •  IR-i-  IR-i-  snch  that  \\Hu\\  <  (/iB(||ti||)  for  all  u  G  . 
o 
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For  a  stable  map  Ti  , 

-  QBBiH)  =  is  X  W(iS)  , 

-  QbuO^)  =  0  , 

“  Qub{J~(-)  niay  or  may  not  be 

Definition  3  (unimodular) 

A  causal  map 
is  stable, 
o 

For  a  unimodular  map  Ti  , 

-  GBB(n)  =  IS  X  . 

~  QbuO^)  —  0  , 

-  QuB{n)  =  0 , 

-  Quu{n)  =  (IS.\IS)  X  (IS.\IS) , 

Definition  4  (incrementally  stable) 

A  causal  map  'H  :  LSe  I^e  i®  incrementally  stable  iff  H  is  stable  and 

there  exists  a  continuous  nondecreasing  '■  IH-+  — >  1R-+  such  that  for  all  u  6  L^e  > 
\\'H{u v)  -  HuW  <  ^wdlufO  for  all  v  €  IS  • 
o 

For  an  incrementally  stable  map,  bounded  deviations  in  the  input  result  in  bounded 
deviations  at  the  output.  The  bound  on  the  output  deviation  is  independent  of  the 
nominal  input  signal  u  . 

In  the  case  of  an  algebraic  incrementally  stable  map  Ti.  where  the  bounding  function 
is  linear  ,  the  algebraic  nonlinearity  is  also  referred  to  as  Lipschitz  continuous. 

•  From  now  on,  the  words  bounded  and  stable  will  be  interpreted  in  the  sense  of  Loo  ■ 

•  The  nonlinear  unity-feedback  system  S{V,C)  denotes  the  interconnection 

y  —  Vu 

u  =  ui  -  C{u2  +  y)  , 

where  Ui  and  ua  denote  the  exogenous  inputs  perturbing  the  actuator  and  sensor  signals 
u  and  y  ,  respectively  (see  Figure  1)  . 

•  The  feedback  system  S{V,C)  is  said  to  be  stable  iff  the  map  from  (tii,U2)  to  {u,y)  is 
stable. 

Note  that  the  stability  of  S{V,C)  requires  that  the  closed-loop  map  exists  and  it  is 
stable.  The  well-posedness  condition  that  ensures  the  existence  of  the  map  is  almost 
always  satisfied  in  practice,  since  V  and/or  C  are  strictly  causal. 


empty. 


is  said  to  be  unimodular  iff  Ti.  is  stable,  bijective  and 
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2  A  Class  of  Feedback  Linearizable  Plants 


Definition  5  (£) 

A  causal  nonlinear  plant  "P  is  said  to  belong  to  the  class  C  iff  there  exist  a  unimodular  map 
Ad,  a  linear  time-invariant  map  P  with  a  finite-dimensional  state-space  representation,  and 
a  stable  map  T  such  that 


(1) 


In  order  to  emphasize  the  particular  triple  (Ai,  P,P)  used,  equation  (1)  will  be  denoted  by 


V  =  C(M,P,A) 


o 


Note  that  if  P  =  C{Al,  P,J-)  (see  Figure  1) 

V  :  u  ^  y  i  ^ 


,  the  map  from  u  to  y  can  be  expressed  as 

=  Pv 
=  u  +  Ty 


Applying  the  stable  output-feedback 


u  =  V  ~  J-y 


the  map  from  v  to  Aiy  is  rendered  linear.  Clearly,  for  a  given  P  G  £  ,  the  triple  (Ad,  P,  P) 
is  not  uniquely  determined.  The  well-posedness  of  the  feedback  loop  in  the  description  of  P 
is  inherently  assumed  by  the  existence  of  the  map  from  u  to  y  ;  in  general,  strict  causality 
of  P  and/or  P  suffice  for  well-posedness  of  the  feedback  loop  in  C(Ai,  P,  P)  ■ 

This  particular  class  of  nonlinear  plants  in  Definition  5  is  in  fact  motivated  by  the  special 
class  of  nonlinear  plants  that  can  be  put  in  the  controllable  canonical  form 

X  =  /(i)  -1-  Bu 

C  =  $(x) 

c  =  +  B(ii  +  f (i))  , 


where  $  is  an  algebraic  change  of  coordinates  and  F  is  an  algebraic  stable  map.  Note  that 
for  this  particular  case,  y  =  x  ,  Aiy  =  ^(y),  Py  =  P(y)  ^-nd  P  =  (si  —  A)~^B  . 

For  plants  in  £  ,  the  linear  part  P  admits  coprime  factorizations;  let 


P  =  JVD-^  =  D-^N 

with  the  Bezout  identity 


u 

V 

■  N 

-V  ' 

-D 

N 

D 

U 

(2) 

(3) 
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where  all  of  the  eight  maps  are  stable.  By  linearity,  stable  maps  are  also  incrementally 
stable.  Note  also  that  for  the  description  in  (2)  for  a  full  rank  P  ,  the  numerators  drop 
rank  at  the  zeros  and  the  denominators  drop  rank  at  the  poles.  Since  P  is  not  necessarily 
single-input  single-output,  poles  and  zeros  (except  blocking  zeros)  may  coincide.  Since  it  is 
easier  to  describe  the  unstable  zeros  and  unstable  poles  of  P  due  to  the  transform  algebra, 
one  can  easily  describe  the  partitions  Qub{P)  and  Qbu{P)  ■  The  following  result  establishes 
the  connection  between  the  graphs  of  P  and  V  ^  C  . 

Fact  6  (on  graphs  of  P  and  V) 

iiV  —  £(Ad,P, .F)  ,  then  there  exists  a  unimodular  map  T  such  that 

1.  Qbb{V)  =  rGBB{P) . 

2.  Qub{p)  =  rguB{P) . 

Proof:  See  Appendix, 
o 


Note  that  Fact  6  establishes  that  the  desired  i/o  pairs  and  the  unstable  dynamics  of  the 
inverse  relations  are  related  in  the  graphs  of  P  and  V  .  Such  a  simple  description  between 
the  rest  of  graph  partitions  cannot  be  established  in  general. 

We  now  consider  the  stabilization  of  plants  in  C  ;  i.e.,  determining  a  causal  nonlinear 
map  C  such  that  the  feedback  system  S{V,C)  (see  Figure  1)  is  stable. 


Figure  1:  Feedback  system  S{V,C)  . 

The  existence  of  a  transformation  of  P  to  a  linear  P  ,  and  Fact  6  suggests  stabilization 
schemes  based  on  designing  controllers  for  P  .  The  following  conjectures  emphasize  this 
design  approach. 

Conjecture  7 

For  a  given  V  =  £(Ad,P, p)  ,  if  S{P,Cp)  is  stable  then  S{V,(CpAi  -|-  p))  is  stable. 

O 
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Conjecture  7  is  based  on  the  design  method  of  undoing  the  nonlinear  maps  J-  and  M  , 
which  consists  of  the  following  steps: 

1.  transform  the  nonlinear  plant  to  an  LTI  one, 

2.  design  a  stabilizing  controller  (possibly  nonlinear)  for  the  LTI  plant  and 

3.  using  the  inverse  transformation  and  the  designed  controller  for  the  LTI  plant,  determine 
the  transformed  controller  for  the  nonlinear  plant. 

Conjecture  8 

For  a  given  V  =  P,J-)  ,  if  S{V,C)  is  stable  then  S(P,(C  —  is  stable, 

o 


If  Conjectures  7  and  8  were  in  fact  true,  one  would  end  up  with  a  necessary  and  sufficient 
condition  for  stabilizing  plants  in  C  by  merely  using  the  transformations  and  nonlinear 
controller  design  methods  based  on  linear  plants.  We  now  investigate  these  conjectures  and 
state  conditions  (other  than  trivial  cases  like  V  =  P  =  £(/,  P,  0))  ,  under  which  related 
results  can  be  established. 


3  Stabilizing  Feedback  Linearizable  Plants 

The  following  fact  (see  e.g.,  [Kabl])  describes  the  set  of  all  nonlinear  stabilizing  compensators 

Cp  in  S{P,Cp)  . 

Fact  9  (5(P)) 

Let  the  causal  linear  map  P  satisfy  equations  (2)  and  (3)  .  Under  these  assumptions,  the 
feedback  system  S{P,Cp)  is  stable  if  and  only  if  Cp  G  S{P)  ,  where 

S(P)  =  {(U  +  DQ){V  -  NQ)-^  I  Q  is  stable  and 

(y  —  NQ)  is  bijective  with  a  causal  inverse} 

The  following  Lemma  justifies  the  design  approach  in  Conjecture  7  for  a  special  subset 
of  plants  in  C  . 

Lemma  10  (A  Sufficient  Condition  for  Stabilizing  C(M,  P,J^)) 

Let  V  =  jC(A4,  P,y^)  ,  where  the  maps  p"  and  Af  are  both  incrementally  stable.  Under  these 
assumptions,  if  S(P,Cp)  is  stable  then  S{V,{CpM  -f  A))  is  stable. 

Proof:  See  Appendix. 

<0 


The  existence  of  a  unimodular  map  that  relates  the  desired  input  output  pairs  of  P  and 
V  (see  Fact  6-1)  ,  is  the  main  motivator  for  Conjectures  7  and  8  .  The  second  part  of  Fact  6 
relates  the  “unstable  zero  dynamics”  of  P  and  P  .  For  a  precise  description  of  the  notion 
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of  zero  dynamics  in  nonlinear  systems  that  extend  the  well-known  case  in  finite- dimensional 
linear  time-invariant  maps,  see  [Isil]  .  Unlike  the  motivating  controllable  canonical  form 
derivation,  the  class  of  nonlinear  plants  in  C  can  have  “unstable  zero  dynamics”  .  In  fact. 
Lemma  10  does  apply  to  such  cases  since  one  can  easily  stabilize  P  with  unstable  zeros.  The 
advantage  of  the  class  C  is  to  take  the  stabilization  concept  of  algebraic-feedback  lineariz- 
able  systems  under  full-state  feedback  (which  relies  on  constructing  an  output  that  has  the 
sufficient  relative  degree  to  avoid  “zero  dynamics”)  to  a  larger  class  where  the  input  output 
map  already  has  “unstable  zero  dynamics”  .  The  virtue  of  Fact  6-2  is  that  one  does  not  need 
to  determine  the  stability  properties  of  the  “zero  dynamics”  of  the  nonlinear  map  V  ,  which 
is  a  highly  nontrivial  task. 

We  now  present  a  counterexample  to  Conjecture  7  . 

Example  11 

Consider  the  single  state  plant  model  V  from  u  to  y  described  by 

y  =  +  u  . 

Note  that  V  =  .  Choose  Cp  =  I  €  S{P)  .  For  C  =  CpM  +  P  =  I  +  {-f  ,  the 

closedloop  system  S{V,C)  is  described  by 

y  =  -(2/ +  “2)  +  -  (t/ +  +  til  • 

Clearly,  for  U2  =  0  ,  the  closed-loop  map  from  u\  to  (u,y)  in  S{V,C)  is  stable.  However, 
for  U2  ^  0  ,  the  closed-loop  system  5(P,C)  is  not  stable.  To  see  this,  consider  ui  =  61  and 
U2  =  S2  where  6i  =  0  ,  i  =  1,2  .  In  other  words,  under  constant  sensor  and  actuator  bias, 
the  closed-loop  system  is  described  by 

y  =  ~(1  +  2^2  )y  +  61  —  62  —  6l 

Hence,  for  S2  <  —0.5  ,  the  output  y  is  not  bounded, 
o 


Example  11  emphasizes  the  importance  of  checking  the  closed-loop  map  from  (ui,U2) 
to  {u,y)  ,  rather  than  just  ui  to  (u,y)  .  The  same  example  also  illustrates  the  effect  of 
estimated  state  feedback  and  how  the  lack  of  a  separation  principle  for  nonlinear  systems 
renders  the  design  procedure  heuristic,  which  happens  to  fail  for  this  particular  case.  When 
an  estimator  with  steady-state  bounded  bias  is  introduced  in  the  ideal  state-feedback,  the 
closed-loop  is  no  longer  stable. 

The  problems  with  Conjectures  7  and  8  are  mainly  due  to  ignoring  the  exogenous  input 
U2  ,  and  relying  on  the  following  fact. 

Fact  12  (Conjectures  7  and  8  under  no  measurement  noise) 

For  a  given  P  =  C{M,  P,P)  , 

5(P,Cp)|y^_Q  is  stable  <=>  +  •^)lu3=o  is  stable  . 

o 
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The  instability  mechanism  in  Example  11  can  also  be  used  to  establish  the  lack  of  perfor¬ 
mance  robustness  under  finite-energy  disturbances,  where  Ui  and  U2  —*  0  •  Given  any  bound 
on  the  amplitude  of  y  ,  there  exists  a  finite- duration  disturbance  (hence  finite-energy)  such 
that  the  closed-loop  output  exceeds  this  bound  before  asymptotically  reaching  zero. 

We  now  state  the  main  result  that  shows  the  conservatism  of  basing  the  stabilization  of 
V  on  the  stabilization  of  P  .  The  following  theorem  establishes  that  Conjecture  7  is  not  true 
in  general  and  that  incremental  stability  is  of  paramount  importance. 

Theorem  13  (Conservatism  of  Conjecture  7) 

Let  V  —  C{M.,  PjT)  with  Ad  incrementally  stable.  Under  these  assumptions, 

S{V  ,  (CpAd  +  T))  is  stable  for  all  Cp  G  S(P)  ,  if  and  only  if  P"  is  incrementally  stable. 
Proof:  See  Appendix. 


Note  that  Theorem  13  emphasizes  that  the  design  procedure  is  no  longer  decoupled 
(namely  first  stabilize  P  and  then  apply  the  transformed  controller  to  P)  if  the  incremental 
stability  of  p"  fails.  When  incremental  stability  fails.  Theorem  13  does  not  imply  that  there 
does  not  exist  a  compensator  for  P  for  which  Conjecture  7  is  valid.  On  the  contrary,  the 
selection  of  Cp  is  crucial  as  emphasized  in  the  following  example. 

Example  14 

Consider  the  plant  P  in  Example  11  .  Choose  Cp  =  7  +  (•)^  .  Note  that  Cp  €  S(l/s)  (see 
Appendix)  .  Apply  the  heuristic  design  procedure  in  Conjecture  7  ,  i.e.,  choose  C  =  CpAd  -f- 
T  .  The  closed-loop  system  5(P,  7  -|-  (•)^  -|-  (  j^)  is  stable  (see  Appendix)  . 


In  order  to  show  how  restrictive  it  is  to  impose  Ad  and  T  are  both  incrementally  stable, 
consider  the  following  example. 

Example  15 

Consider  the  map  V  from  u  to  y  described  by 

=  X2  +  f{xi) 

X2  =  X3 


where  the  algebraic  nonlinearity  /  is  bounded,  i.e.,  there  exists  a  fc  >  0  such  that  |/(x)|  <  k 
for  all  X  6  IR  .  Clearly,  /(•)  ,  is  stable  and  incrementally  stable.  Moreover,  for  any  linear  time- 
invariant  finite-dimensional  C  such  that  5(1/5^[1  s  l^jC)  is  stable,  the  interconnection 
S{V,  C)  is  stable.  Now  consider  the  plant  V  in  terms  of  the  canonical  form  C  .  Provided 
that  /  is  at  least  twice  differentiable,  under  the  coordinate  transformation 


^2  +  f{xi) 

X3  +  X2f^^\xi)  +  /(ii)/('^(xi) 


and  algebraic  feedback 

Tx  =  -t-  Xzf^^\xi)  +  2x2f{xi)J^^\xi)  + 

x2U»K^,)f  +  /M(f<‘>(x,)r  + 

one  has 

,^)  ■ 

Clearly,  Ad  is  unimodular  and  is  stable.  However,  unless  vanishes,  is  not  incremen¬ 
tally  stable, 
o 


V  -  C{M,- 


s 

o2 


4  Conclusion 

In  the  course  of  bounded-input  bounded-ouput  stabilization  of  nonlinear  plants,  those  that 
can  be  transformed  into  an  LTI  one  are  of  particular  interest  due  to  a  vast  pool  of  theoretical 
and  computational  tools  associated  with  LTI  design  problems.  The  existing  transformation 
immediately  suggests  a  simple  design  approach:  design  a  stabilizing  control  law  for  the 
transformed  LTI  plant  and  then  use  the  transformed  control  law  for  the  nonlinear  plant. 
As  it  is  shown  in  the  study,  such  an  approach  is  not  a  sure  design  procedure  unless  more 
restrictions  are  imposed  on  the  transformation.  The  design  for  the  transformed  LTI  plant 
is  really  not  decoupled  from  the  original  nonlinear  design  problem;  hence,  one  should  not 
rule  out  intentionally  nonlinear  controller  design  for  the  transformed  LTI  plant.  Although 
this  might  seemingly  defy  the  purpose  of  the  transformation,  since  the  transformed  design 
problem  is  yet  another  nonlinear  control  design  problem,  it  also  emphasizes  the  need  for  more 
results  in  nonlinear  control  design  for  LTI  plants.  Along  this  line,  for  a  given  LTI  plant,  one 
has  a  complete  parametrization  of  all  nonlinear  stabilizing  controllers  ;  however,  the  choice 
of  the  parameter  subject  to  a  particular  cost  is  not  straightforward.  Another  approach  is 
based  on  optimal  control  with  non-quadratic  integrand  cost  criteria  from  which  nonlinear 
state-feedback  laws  are  derived. 


5  Appendix 

Proof  of  Fact  6 

Let  V  =  £(A4,  PjJ-)  .  From  Figure  1  ,  the  equations  relating  the  (u,t/)  pair  and  {v,  Pv) 
pair  define  the  map  T  as  follows: 

T{v,Pv)  =  {u,y)  =  (v  —  Pv,  Pv) 

{v,Pv)  =  T”^(u,y)  =  {u  +  J^y,My)  . 

Since  T  is  stable  and  Ad  is  unimodular,  T  defined  above  is  also  unimodular.  By  unimodu¬ 
larity  of  T  , 

(u,y)  e  ^bb('P)  {v,Pv)  ^  Qbb{P)  , 
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which  establishes  Fact  6  1). 

Let  (u,  y)  6  Qusi'P)  ■  Since  M  is  stable,  Pv  =  My  is  bounded.  Since  u  is  unbounded 
and  TM~^Pv  is  bounded,  v  =  u  -\-TM~^Pv  is  unbounded.  Hence  {v,Pv)  €  Qub{P)  • 
Similarly,  v  unbounded,  Pv  bounded  imply  y  bounded  and  u  +  J-y  unbounded,  hence  u  is 
unbounded.  This  establishes  part  2)  . 

□ 

Proof  of  Lemma  10 

By  Fact  9  ,  the  stability  of  S(P,Cp)  implies  that  Cp  —  (U  DQ)[V  —  NQ)~^  ,  for 

some  stable  Q  .  Any  input  output  pair  corresponding  to  the  map  Cp  can  be  expressed 

as  the  pair  ((V^  —  NQ)^,{U  +  DQ)()  ,  where  ^  denotes  the  pseudo-state  of  this  particular 
factorization.  For  this  particular  factorization,  the  input  output  pair  of  Cp  is  bounded  if  and 
only  if  ^  is  bounded.  Moreover,  since  T,  M  and  M~^  are  stable,  the  closed-loop  system 
S{V,(CpM  +  P))  is  stable  if  and  only  if  the  closed-loop  map  from  (ui,U2)  to  ^  is  stable. 
Writing  the  equations  describing  the  feedback  system  S{V,  {CpM  +  P))  in  terms  of  we 
obtain  (see  Figure  1  and  equation  (2)) 

DMy  =  N{ui^Py-P{u2+y)-{U  ^DQ)i)  (4) 

iV-NQ)(  =  M(u2  +  y)  .  (5) 

Adding  DM(u2  +  y)  to  both  sides  of  (4),  substituting  (5)  and  (3)  and  using  the  linearity  of 
N  and  D  ,  we  obtain 

(  =  Nui  -  N{P{y  -f  U2)  -  Py)  +  D{M{y  +  U2)  -  My)  . 

Clearly,  if  P  and  M  are  incrementally  stable,  ^  is  bounded  for  all  bounded  Ui  and  U2  .  □ 
Proof  of  Theorem  1 

“if”  Follows  by  Lemma  10. 

“only  if”  Follows  by  Example  11  ,  by  setting  M  =  /,  P  =  l/s,  P  =  (  and  Cp  =  1  . 

□ 

Proof  of  the  claim  in  Example  14 

The  closed  loop  system  S{\ / s,  I  -\-  (•)^)  is  described  by 

X  =  Ui  -  (x  +  U2)  -  (x  +  U2)^  . 

The  closed-loop  system  S(V,  I  (■)^  -f  (•)^)  is  described  by 

X  =  ui  +  —  (x  -f-  U2)  —  (x  4-  1x2)^  —  (x  -|-  1x2)^ 

For  Ui  and  U2  in  Loo  ,  using  the  Lyapunov  function  candidate  V'(x)  =  x^  ,  in  both  cases 
for  |x|  sufficiently  large  V  along  the  solution  trajectories  is  eventually  negative.  Hence  the 
closed-loop  map  from  (ui,U2)  to  (u,x)  is  stable  in  both  cases. 
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